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Preface to the Second Edition 


Since the appearance in 1971 of the first edition, Chapters III, IV, 
and V of this book have served, with certain modifications, as the basis 
of a two-term course on functional analysis in quantum mechanics 
offered at the University of Toronto at the advanced undergraduate and 
first-year graduate level. The aforementioned modifications have now 
been incorporated in the present edition, together with additional 
material which brings the book up to date, and in some instances 
provides an introduction to topics of contemporary research interest. 

Practically all alterations occur in the second half of the book, and 
especially in the part dealing with quantum scattering theory. In 
addition to reorganized exposition, they consist of substantial simplifica¬ 
tions in the proofs of various theorems. Hence it is hoped that the text 
has now become in its entirety easy to follow for any student in mathe¬ 
matics and/or physics interested in the subject. Furthermore, a number 
of completely new sections have been included summarizing research 
results which have been published during the past decade and that 
directly pertain to the subject matter of this book. Since a detailed 
coverage of all the material contained in Chapters III-V would be 
unfeasible in a typical one-year course, asterisks have been liberally 
employed to indicate proofs of theorems, and even full sections, that 
can be skipped without making the subsequent material incompre¬ 
hensible. 

As was the case with the first edition, the goal of the present edition 
is not to supplant but rather to supplement any of the good textbooks 
on nonrelativistic quantum mechanics, which as a rule have a purely 
physics orientation. More precisely, the chief aim is to provide in a 
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Preface to the Second Edition 


readable and self-contained form, which would be accessible even to 
students without an extensive mathematical background, rigorous proofs 
of all the main statements of a functional-analytic nature that one 
encounters in standard textbooks on quantum mechanics. It should be 
emphasized that no attempt has been made to include any material 
dealing with complex analysis or with group theory. Indeed, in the case 
of complex analysis a reasonable level of mathematical rigor is maintained 
in many of the better physics textbooks on quantum scattering theory, 
whereas the subject of group theory in quantum mechanics has already 
received ample attention in the form of monographs and compilations 
of review articles on the subject at all conceivable levels of mathematical 
sophistication. 

Albeit a few monographs dealing in part with some of the subject 
matter of this book have also been published since the first edition made 
its initial appearance, as a rule they tend to lean heavily toward the 
purely mathematical aspects of the subject. It is therefore hoped that 
the present edition may still perform a useful function by filling the gap 
between purely physics and purely mathematics oriented textbooks and 
monographs pertaining to Hilbert space theory as applied to non- 
relativistic quantum mechanics. 


Toronto , 1981 


E. PrugoveCki 



Preface to the First Edition 


This book was developed from a fourth-year undergraduate course 
given at the University of Toronto to advanced undergraduate and 
first-year graduate students in physics and mathematics. It is intended 
to provide the inquisitive student with a critical presentation of the 
basic mathematics of nonrelativistic quantum mechanics at a level 
which meets the present standards of mathematical rigor. It should also 
be of interest to the mathematician working in functional analysis and 
related areas, who would like to see some of the applications of the basic 
theorems of functional analysis to quantum mechanics. 

With these aims in mind, I have tried to make the book self-contained. 
A knowledge of advanced calculus, linear algebra, the basics of ordinary 
linear differential equations, and the very basic concepts of classical 
mechanics (such as the notions of force, momentum, angular momentum 
energy, etc. of an n-particle system) provide an adequate background 
for the reader of this work. 

The material is organized in the form of definitions, theorems, and 
proofs of theorems. Such a format has the advantage of enabling the 
reader to grasp immediately the basic concepts and results, separating 
them from the sometimes tedious techniques. In order to facilitate the 
perusing of the book preceding a more thorough reading, marginal 
asterisks indicate those theorems whose proofs can be skipped at a first 
reading. 

I consider the material contained in the present book to stand in 
the same relation to quantum physics, as the material of conventional 
mathematical physics stands in relation to classical physics. Therefore, 
the prevailing style and approach is similiar to texts on mathematical 
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physics: the mathematical methods are in the foreground; the physical 
examples appear only as illustrations of these methods. Emphasis is 
placed on general concepts and basic techniques of proving theorems, 
rather than on computational methods. Consequently, the reader who is 
not already familiar with the material that can be found in a conventional 
book on quantum mechanics will find it extremely useful to take at least 
an occasional glance at any of the standard textbooks on quantum 
mechanics (some of which are listed in the References) which cover 
the physical aspects of quantum theory in more detail. 

Since the text is oriented toward physical application, I have given 
the minimal amount of mathematics necessary for a good understanding 
of the main mathematical aspects of nonrelativistic quantum mechanics. 
Consequently, basic material on measure theory and functional analysis 
has been introduced in a selective manner, always keeping in mind its 
application to quantum mechanics. However, I believe that the interested 
reader can easily expand his knowledge of the areas of mathematics 
introduced in this book by using the reference books and the present text 
as a starting point. The list of references given at the end of each chapter 
is meant to help him in this task. Furthermore, since the last two chapters 
contain mostly material which until now was published only in original 
papers but not in textbooks or monographs, I have supplemented these 
references with a few historical notes. However, many of the mathe¬ 
matical tools used in the present book have already become standard. 
Hence, no effort has been made to provide an exhaustive bibliography 
or to give credit to the original source, with the exception of names 
which are now customarily associated with certain theorems, formulas, 
equations, etc., and of references related to topics of current research 
interest. 

Most of the exercises form an integral part of the text. Consequently, 
though many of these exercises are relatively easy to solve, it has been 
deemed necessary to provide the solutions or detailed hints appearing 
at the end of the text. However, the reader is urged to try and solve 
these exercises by himself before looking up the solutions. 

The book should appeal to the increasing number of students in 
theoretical physics who desire to understand the justification of the 
multitude of heuristic procedures and shortcuts ordinarily employed 
in the typical books and courses on nonrelativistic quantum mechanics. 
Such procedures, which until a few decades ago had to be taken at face 
value, can be now rigorously justified, due to advances in functional 
analysis in general, and in the Hilbert space theory in particular. 

E. PrugoveCki 


Toronto , 1971 
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Introduction 


The beginning of the era of modern quantum mechanics is marked 
by the year 1925 and the two almost simultaneous papers of Heisenberg 
[1925] and Schroedinger [1926]. The first of these papers proposes 
essentially the formalism of matrix mechanics, while the second one 
proposes the formalism of wave mechanics. It was first indicated by 
Schroedinger that these two formulations are physically equivalent. 
Both can be embraced in a more general formulation of quantum 
mechanics, which was first proposed in a somewhat heuristic form by 
Dirac [1930]. A mathematically rigorous development of this general 
formalism of quantum mechanics can be achieved by taking advantage of 
Hilbert space theory, which is done in this book.* 

Though classical and quantum mechanics are very dissimilar in many 
respects, they do share a lot of common features when viewed from a 
general structural point of view. To find these common features, one 
must have some insight into the mechanism of a physical theory from an 
abstract point of view, at which one arrives by ignoring the technical 
details of the theory. 

One can dissect any physical theory into the following main consti¬ 
tuents: (1) formalism, (2) dynamical law, (3) correspondence rules. 

Viewed from an abstract point of view, the formalism consists of a set 
of symbols and rules of deduction. With these symbols one can build 
statements or propositions. The rules of deduction enable us to deduce 
new statements from those already given. 

Ideally, every scientific theory starts with a set of basic statements 


# For a detailed historical survey of the origins and development of quantum theory 
consult any of the many standard textbooks on quantum mechanics (e.g., Messiah [1962]). 
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called the axioms of the theory. However, in practice, while a theory is in 
the growing process, it very often happens that no set of axioms is 
clearly stated and the axiomatic formulation of the theory is left to the 
future. Nonrelativistic quantum mechanics has an accepted axiomatic 
formulation (given in Chapter IV) which covers all the present practical 
needs, though proposals for new sets of axioms are still making their 
appearance in scientific journals. These proposals have mainly the virtue 
of greater generality over the conventional Hilbert space formulation 
adopted here (so that they contain this formulation as a special case). 
However, from the practical point of view they have not yet produced 
any new results not derivable from the conventional approach. 

The two classes of basic objects in the formalism of both classical and 
quantum mechanics are the states and observables. We shall illustrate 
these concepts by a few examples. 

In the Newtonian approach to classical physics, the state of a system 
is given by the family of trajectories of all particles constituting the 
system. For instance, the state of one particle moving in three dimensions 
is given by a vector-valued function r(£), r(£) e M? (IR n denotes an 
n-dimensional real Euclidean space), in the time-parameter t. In the 
Hamiltonian (canonical) approach to classical mechanics, the state of 
a single particle is also a vector-valued function 

(?l(0. ?2(0. ?s(0. />l(0. p2(t), Ps(t)), 

defined now in a six-dimensional real Euclidean space called the phase 
space of the system. In the Schroedinger formulation of quantum 
mechanics, the state is again a vector-valued function t ), only this 
time the function assumes values from an infinite-dimensional complex 
Euclidean space which bears the name of Hilbert space. 

The observables of the formalism are symbols related to specific 
experimental procedures for measuring them. For instance, in Newtonian 
classical physics of a one-particle system, r(t) is the position of the particle 
at time t\ p(£) == mr(t) its momentum (r (t) — dr(t)ldt ); mr 2 (t)l2 its 
kinetic energy (m denotes the mass of the particle), etc. In the Hamiltonian 
approach ^(J), q 2 (t ), q z (t)> p^t), /> 2 (0> anc ^ Pzit) are > m general, related 
by some functional relation to the Cartesian position and momentum 
components of the particle. In quantum mechanics, observables are 
represented by so-called self-adjoint operators in the Hilbert space 

Every physical theory contains a particularly important ingredient 
called the dynamical law (frequently referred to as the “dynamics”). This 
dynamical law is, in general, a relation which some of the basic objects 
of the formalism must satisfy. It is the key component of the theory 
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because it gives the theory its predictive power. From a formal logical 
point of view, the dynamical law could be considered a part of the 
formalism. However, due to its central role in predicting the future 
behavior of the system from a knowledge of its present or past behavior, 
we prefer to single it out. 

From the mathematical point of view, the dynamical law is very 
frequently a differential equation which has to be satisfied by the state 
of the system, and in that case it is called the equation of motion . For 
instance, in Newtonian classical mechanics of a single particle of mass m 
moving under the influence of a force F, the equation of motion is given 
by Newton’s second postulate 

(0.1) «*(<)= F(f), r{t) = d ^f. 


In the canonical formalism of a single particle we have 


dqijt) = dlf 

dt dp k 9 


dp k (t) 3H 

dt dq k 9 


k = 1,2, 3, 


where H(q k , p k , t) is the Hamiltonian of the system. In the Schroedinger 
formulation of quantum mechanics, the equation of motion governs the 
time evolution of the state of the system, and is called the Schroedinger 
equation. However, in the Heisenberg formulation of quantum mechan¬ 
ics, the equation of motion, called the Heisenberg equation, does not 
involve the state (which in this case is time independent) but rather the 
observables of the system (see Chapter IV, §3). 

It must be realized that the dynamical law need not always be expressed 
by a differential equation. Very frequently, it is expressed by an integral 
equation, sometimes derivable from a differential equation. In modern 
quantum scattering theory, the dynamical law is given by analyticity 
conditions that certain key functions (the so-called *S-matrix elements) 
appearing in the theory have to satisfy. 

The correspondence rules are the rules which assign empirical meaning 
to some of the symbols appearing in the formalism. As such, they provide 
the link between theory and experiment. The body of all correspondence 
rules of a theory is in physics better known under the name of the 
physical interpretation of the theory. 

In classical mechanics, which has a very direct intuitive appeal, the 
correspondence rules are very straightforward, and they leave no room 
for plausible alternatives. For instance, in Newtonian one-particle 
mechanics, it is well known that for a particle in the state r(t), t e R 1 , 
the components of the vectors r (t), mf(t) f etc. represent the length of the 
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projections of the position, momentum, etc. vectors on the three axes 
of an inertial frame* with Cartesian coordinates. 

In the case of quantum mechanics, which does not yield itself to 
visualization as readily as classical mechanics, the matter of physical 
interpretation is much more complex and therefore open to discussion. 
In this case the desirability and empirical consistency of the entire body 
of correspondence rules is still open to scrutiny and critical appraisal, 
being treated in its own right in the theory of measurement of quantum 
mechanics. However, the main features of the now widely accepted 
interpretation, sometimes called the Copenhagen school interpretation 
(see also Chapter IV, §1), are quite clear. The most striking of these 
features is that this interpretation is essentially statistical; it does not 
provide us with definite statements about the future behavior of a 
quantum mechanical system, but rather with probabilistic statements 
about the likelihood of different patterns of behavior. It should be men¬ 
tioned that this feature has given rise to much controversy as to whether 
quantum mechanics provides the ultimate tool in describing the behavior 
of atomic and subatomic systems. The ranks of physicists who have 
adopted a critical attitude when considering this subject include notably 
the names of some of the pioneers in the field, such as Einstein, 
Schroedinger, + and de Broglie. 

Part of the physical interpretation of the theory consists in giving 
correspondence rules which assign to observables Experimental proce¬ 
dures for measuring them. From the mathematical point of view, these 
observables are represented (in the conventional formalism adopted here) 
by self-adjoint operators in a Hilbert space. These operators will be 
studied from a general point of view in Chapter III. At present we shall 
describe some experimental procedures for measuring, in principle, * 
the most common of observables: those of position, momentum, and 
spin of an object of atomic or subatomic size (we shall call such an object 


* We remind the reader that an inertial frame is a physical body (usually the laboratory 
of the experimenter) in which Newton’s equation of motion is valid for macroscopic bodies 
moving at speeds which are low by comparison with the speed of light. It is an empirical 
fact that a frame of reference moving during the duration of the experiment at uniform 
speed with respect to the sun is a very good approximation to an inertial frame. 

+ It is interesting to mention that one of the earlier physical interpretations of the forma¬ 
lism of wave mechanics (which constitutes a special case of the Hilbert space formalism of 
quantum mechanics) was proposed by Schroedinger, was in total disagreement with the 
Copenhagen school interpretation, and lacked any statistical features. 

* The experimental procedures for measuring these observables “in principle” may 
be conceptually very simple, but from the purely technical point of view they may not be 
the moat efficient or the easiest to carry out with a high degree of accuracy and at a given 
cost. 
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a microparticle, so as not to confuse it with the more special term, 
elementary particle). 

Measurements for determining the position of a particle can be carried 
out by a number of detectors (Wilson chambers, bubble chambers, 
Geiger-Muller counters, etc.). These detectors signal the presence of a 
particle within the volume enclosed by the detector by a directly notice¬ 
able (macroscopic) change of their state. Conceptually the simplest of 
such detectors is a photographic plate on which a landing microparticle 
leaves a characteristic mark. 

The essential parts of an apparatus for determining the momentum of 
a charged particle are depicted in Fig. 1. If particles of known charge e 



Fig. 1. Experimental arrangement for determinative measurement of the momentum 
of a charged microparticle. 

interact to the left of the screen* S x , and if a particle is detected by the 
detector depicted in Fig. 1 (which can be any of the types mentioned, 
or some other type best suited to the needs of a particular experiment), 
then the determined momentum p of particle is of magnitude (in suitable 
units) 

(0.2) IP I =eHr 09 

and in the direction indicated in Fig. 1; in the above formula, H is the 
strength of a uniform magnetic field H, which is present to the right of 
the screen S 2 and orthogonal to the plane of drawing. 

# Both screens S x and S 2 should be impenetrable to all the particles used in the 
experiment. This impenetrability can be established by closing the apertures in the 
screens and checking that a detector placed to the right of each screen does not detect 
anything except the “background noise,” which has a definite pattern and is due to the 
everpresent cosmic rays. 



6 


Introduction 


It must be stressed that the above-mentioned experimental arrange¬ 
ments are the ones which give an empirical meaning to the concepts of 
position and momentum of a microscopic particle. In other words, it is 
not meaningful to ask the question: “How do we know that the macro¬ 
scopic change of state taking place in a detector (the black mark on a 
photoplate, or the track of condensed vapor in a Wilson chamber) is 
“in reality” due to a particle, and that the momentum of that particle 
is indeed given by Eq. (0.2) ?” Such a question is empirically meaningless 
because, in the ultimate analysis (see Heisenberg [1925, p. 174]), the 
only knowledge that we possess about the microscopic world is the one 
we can extract from macroscopic phenomena. 

The above type of measurement determines the value of observables 
at the instant the measurement is carried out. More precisely, from such 
a measurement one can compute, within bounds of accuracy inherent in 
the particular apparatus, what would be the value of a particular observ¬ 
able if there were no disturbances due to the interaction of the system 
(which in the above cases is the microparticle) and the apparatus. This 
type of measurement will be called a determinative measurement. 

A determinative measurement does not tell us, in general, what the 
value of the measured observable, or even what the fate of the system 
will be after the measurement. For example, the system might be com¬ 
pletely destroyed or cease to be an independent entity (for instance, 
when the detector is a photoplate and the particle is absorbed by it) after 
the measurement. In general, the measurement process will disturb the 
system or even change its nature (particle “creation and annihilation”), 
and such a disturbance must be taken into account if the measurement 
is used as a preparation of a system with a certain value of the measured 
observable. This phenomenon is characteristic of microphysics and 
cannot be ignored, as is done in macrophysics; in principle it exists also 
in macrophysics, but from the practical point of view it is completely 
negligible. To realize this, one should recall that measurements in 
classical mechanics require “seeing” the system, i.e., the reflection or 
emission of light from the objects constituting the system. Now, light 
(i.e., photons) has a certain momentum, which is imparted to the objects 
on which it impinges or by which it is emitted, but that momentum is 
completely negligible when dealing with objects of macroscopic size. 

Thus, in quantum mechanics it is important to distinguish between deter¬ 
minative measurements of the above type and preparatory measurements.* 

# It is more common to call a determinative measurement simply a “measurement,” 
and a preparatory one a “preparation of state.” We prefer to avoid the term “preparation 
of state,” since it suggests that such a measurement will prepare a quantum mechanical 
state , which is by no means always true. 
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SOURCE 

OF 

PARTICLES 



Fig. 2. Experimental arrangement for a preparatory measurement of the momentum 
of a charged microparticle. 


For instance, in the modification in Fig. 2 of the apparatus in Fig. 1, 
we have a device which is meant to prepare charged particles of a given 
momentum. If we keep the shutter open for a very short 4 ‘instant”* 
from t Q to £ 0 -j- At, then any particle that might have emerged during 
this period to the right of the screen S (i.e., in the interaction region, where 
it would interact with other particles adequately prepared) would have a 
momentum of magnitude | p | given by the formula (0.2), and in the 
direction indicated on Fig. 2. 

A striking feature of the above experimental procedure is that it does 
not tell us, by itself, whether there was indeed a particle which had 
passed through the aperture while the shutter was open. This feature is 
common to many preparatory measurements. It is in the determinative 
(later) stage of the experiment that the presence or absence of a particle 
is established/ 

We note that the above preparatory procedure cannot prepare a sharp 
value p of the momentum, but rather, due to the finite size of the apertures 
Oi, 0 2 , and 0 3 , it prepares a whole range A C U 3 of momenta. When 
the size of the last aperture 0 3 is such that so-called “diffraction effects” 
can be neglected, this prepared range A consists of all the momenta that 
a particle of charge e would possess when traveling along all the imagin¬ 
able paths that such a particle would have to follow according to classical 
mechanics in order to pass through the apertures O x , 0 2 , and 0 3 . 

The above-mentioned diffraction effects are again characteristic of 

# By a very short “instant” is meant a time interval of duration At which is negligible 
in comparison with the other errors of measurement occurring in the particular experiment 
in which the described preparatory measurement is included. 

+ We could, of course, add a detector to the above apparatus, but such a detector 
would significantly disturb the prepared value of the momentum. 
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microphysics, and constitute the so-called “wave nature” of micro¬ 
particles. They correspond to the experimental observation that if a 
detector (e.g., a photoplate) is placed behind two parallel impenetrable 
screens each having one aperture, a “particle” might be detected not only 
along any straight line passing through the two apertures, but also at other 
points. If we imagine that a microparticle travels along a trajectory, then 
it would seem that the trajectory is in such cases bent after the particle has 
passed through the aperture. If we have a beam of particles (i.e., many 
independent particles), then the net effect which will be observed in a 
photoplate placed behind the screens is a diffraction pattern, qualitatively 
very similar to the diffraction pattern of a beam of light. 

We note that the experimental arrangement in Fig. 2 prepares not 
only a range A of momenta, but also a range A' of the position observ¬ 
ables, where A' is the region enclosed by the aperture 0 3 . However, 
while classical mechanics assumes that the linear dimensions of A and A' 
could be made arbitrarily small by building sufficiently precise apparatus, 
it is a direct consequence of the diffraction effects that this is not the case 
with microparticles. This experimental finding is embodied in 
Heisenberg’s uncertainty principle which states that no apparatus can be 
built which would prepare a particle to have the x coordinate (in some 
Cartesian frame of reference) within the interval A x and the ^-momen¬ 
tum component within the interval A p , so that the inequality 

(0.3) \A X \\A V J^ h/lir 

is not satisfied, where | A x | and | A p ^ | denote the lengths of the respective 
intervals, and h is a universal constant called the Planck constant; 
similar relations 

>A/2ir, >A/2ir, 

hold for the^y and z Cartesian components. 

It should be mentioned that the experimental arrangements in 
Figs. 1 and 2 determine and prepare, respectively, also the energy of 
the microparticle on which the measurement is carried out. By definition, 
if the momentum of a free particle of mass* m is p, then its energy is 

E = p 2 /2m. 

# The mass of a particle can be measured by the experimental arrangement in Fig. 1 if 
we introduce an electric field E orthogonal to the existing magnetic field H. Quantities 
like mass, charge, and intrinsic spin are characteristic of each kind of microparticle, and, 
in fact, provide the only means of differentiating types of microparticles, like electrons, 
positrons, neutrons, protons, hydrogen atoms, water molecules, etc. 
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We turn now to measurements of spin. Assume that we have in 
classical mechanics a body which rotates, with respect to an inertial 
frame of reference, around a certain axis passing through it, which is 
stationary or moves at a uniform speed with respect to that frame of 
reference. Such a body has then an angular momentum s with respect 
to that axis. If the dimensions of the body are negligible with respect to 
other dimensions in the experiment, so that the body can be called a 
particle, then s is called the spin of the particle. If the particle carries a 
charge, then due to its spin it will have a magnetic (dipole) moment p 
proportional to s. When such a particle travels through an area in which 
there is an inhomogeneous magnetic field H, this field will act on it with 
a force 

F = grad(p • H). 

In the light of the above remarks, each one of the particles of equal 
velocity (i.e., equal momentum) and equal spin in a beam passing through 
the experimental arrangement depicted in Fig. 3 would be deflected 



Fig. 3. The Stern-Gerlach experiment. 

from a straight path by an amount proportional to the projection (i z of 
the magnetic moment p of that particle in the direction of the gradient H' 
of the field H. If the spins of the particles were randomly oriented in 
all directions (as we could expect them to be if the source were a gas of 
such particles), then one would predict on the basis of the above con¬ 
siderations a continuous distribution of marks on the photoplate. How¬ 
ever, the Stern-Gerlach experiment carried out with molecular and 
atomic beams reveals instead, in general, n discrete lines (in Fig. 3, 
n = 2)—a phenomenon which is usually described by saying that the 
spin is quantized. 




10 


Introduction 


In quantum mechanics, by definition, the spin of the above particles 
is taken to be 


s 


n — 1 


2 


(though, strictly speaking, for reasons to become clear when a systematic 
theoretical study of the spin is undertaken, the total spin is [$($ + l)] 1 / 2 ). 
Thus, s can assume only integer and half-integer values J, 1, f, 2,...; 
in Fig. 3 we have depicted the case of spin 

The experimental arrangement of Stern and Gerlach can be used as 
an apparatus for a determinative measurement of the spin component 
in the direction H.' In that case the source of particles would originate in 
the interaction region, where particles of known spin are interacting. 
If a particle of integer spin* s leaves a mark at O, then, by definition, it has 
spin zero in the H'direction; the first, second,..., (n — l)/2 mark above O 
correspond to spin components in the IT direction equal to 1, 2,..., 
(n — l)/2, respectively, while the first, second,..., (n — l)/2 marks below 
O correspond to spin components —1, —2,..., —(n — l)/2, respectively. 
In case of a particle of half-integer spin, there will be no middle mark; 
the first, second,..., (n — l)/2 marks above or below O correspond to spin 
components £, f,..., (n — l)/2, or — — (n — l)/2, respectively. 

Hence we see that according to the very definition of the spin projection 
onto a certain axis, that projection can assume only integer values in case 
of integer-spin particles, and only half-integer values in case of half¬ 
integer-spin particles. 

The above experimental arrangement can be easily transformed into 
an apparatus for preparatory measurements of spin by replacing the 
photoplate with a screen which has apertures at the spots where a beam 
of particles from the given source had left tracks. It has to be mentioned 
that no simultaneous measurements of spin in two different directions can 
be carried out on microparticles—a feature which is in complete agree¬ 
ment with certain properties (noncommutativity of spin-component 
operators) of the formalism of quantum mechanics. 

Here we end this short survey of some of the experimental procedures 
for measuring some of the basic observables which occur in quantum 
mechanics, and which will frequently appear in the pages of this book. 
In Chapter I we start our systematic study of the Hilbert space formalism 
of quantum mechanics, and related mathematics. 


# This means that a beam of such particle with random-oriented spins would have 
2s + 1 tracks on the photoplate. 
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Basic Ideas of Hilbert Space Theory 


The central object of study in this chapter is the infinite-dimensional 
Hilbert space. The main goal is to give a rigorous analysis of the problem 
of expanding a vector in a Hilbert space in terms of an orthogonal basis 
containing a countable infinity of vectors. 

We first review in §1 a few key theorems on vector spaces in general, 
and in §2 we investigate the basic properties of vector spaces on 
which an inner product is defined. In order to define convergence in an 
inner-product space, we introduce in §3 the concept of metric. In §4 we 
give the basic concepts and theorems on separable Hilbert spaces, con¬ 
centrating especially on properties of orthonormal bases. We conclude the 
chapter by illustrating some of the physical applications of these mathe¬ 
matical results with the initial-value problem in wave mechanics. 


1 ♦ Vector Spaces 


1.1. Vector Spaces over Fields of Scalars 

A mathematical space is in general a set endowed with some given 
structure. Such a structure can be given, for instance, by means of certain 
operations which are defined on the elements of that set. These operations 
are then required to obey certain general rules, which are called the 
postulates or the axioms of the mathematical space. 

Definition 1«1* Any set 'V' on which the operations of vector 
addition and multiplication by a scalar are defined is said to be a vector 
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space (or linear space , or linear manifold ). The operation of vector addition 
is a mapping, * 

V'g)-+f + g' if,g)e'Txr, f + gerr, 

of Y X Y into Y, while the operation of multiplication by a scalar a 
from a field 1 F is a mapping 

(aj)^afi (aJ)eFxY, afeY, 

of Fxf into Y. These two vector operations are required to satisfy 
the following axioms for any/, g,heY and any scalars a, b e F : 

(1) / + £—£+/ (commutativity of vector addition). 

(2) (/ + g) + h — f + (g + h) (associativity of vector addition). 

(3) There is a vector 0, called the zero vector , such that g satisfies 
the relation/ + g = / if and only if £= 0. 

(4) a(f + g) = af + ag. 

(5) (a + b)f=af+bf. 

( 6 ) (ab)f=a(bf). 

(7) 1/ = f y where 1 denotes the unit element in the field. 

By following a tacit convention, we denote a mathematical space 
constructed from a set S by the same letter S , except where ambiguities 
might arise. Thus, we shall denote by Y the vector space consisting of a 
set Y together with the vector operations on Y also by Y. 

When in a vector space the multiplication by a scalar is defined for 
scalars which are elements of the field F , we say that we are dealing with 
a vector space over the field F. If the field F is the field of real or complex 
numbers the vector space is called, respectively, a real or a complex vector 
space. 


* We remind the reader that a mapping M of a set into a set T is any unambiguous 
rule assigning to each element £ of *5 a single element M(|) of T; M(£) is called the image 
of £ under the mapping M. The set £ is the domain of definition of M, while the subset 
7\ C T of all image points M(£), T x — (17 = M(£): I e S}> is the range of M. If 7\ = T, 
then we say that M is a mapping of the set *5 onto the set T. 

If S n are sets, then X ••• X S n denotes the family tj n ) of all n -tuples 

of elements £ n e S n , and is called the Cartesian product of the sets S x S n . 

f A field is a set on which field operations of summation and multiplication are defined, 
i.e., operations satisfying certain axioms. We do not give these axioms because in the 
sequel we are interested only in two special well-known fields: the field of real numbers IR 1 
and the field of complex numbers C l consisting, respectively, of the set of real numbers IR 1 
and the set of complex numbers C 1 on which the field operations are ordinary summation 
and multiplication of numbers (see Bi rich off and MacLano 11953]). 
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As an example (see also Exercises 1.1, 1.2, and 1.3) of a real vector 
space consider the family ((R n ) of one-column real matrices and define for 

/«A /V\ 


vector summation by the mapping 


( 1 . 1 ) 


(“, P) “ + P = 


/«! + *l\ 


\a n + b n ) 

and for any scalar a e U 1 define multiplication of a by a as the mapping 


aaA 

( 1 . 2 ) ( a , a) !-► aoc = I j 1 . 

\aaj 

It is easy to check that Axioms 1-7 in Definition 1.1 are satisfied. 

Analogously we can define the complex vector space (C n ) by intro¬ 
ducing in the set C n of one-column matrices vector operations defined 
by the mapping (1.1) and (1.2), where now a, e C n , and therefore 
a i >•••> a n > >•••> b n , as well as the scalar a , are complex numbers. 


1.2. Linear Independence of Vectors 

Theorem LL Each vector space 'V' has only one zero vector 0, 
and each element/of a vector space has one and only one inverse (—/). 
For any f e i^, 

0/ = 0, (-!)/=(-/). 

Proof. If there are two zero vectors 0 X and 0 2 , they both have to 
satisfy Axiom 3 in Definition 1.1, 

/ = / + 0 1 = / + 0 2 

for all /. Hence, by taking / — 0 X we get + 0 2 , and then by 

taking / = 0 2 we deduce that 0 2 = 0 2 + 0 X = 0 1 + 0 2 = 0 X . Now 

/= 1 / = (1 + 0 )/ = 1 / + 0 / = / + 0 / 

and therefore 0/ = 0. We have 
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which proves the existence of an inverse (—/) = (— 1 )/ for /. This 
inverse (—/) is unique, because if there is another A e Y such that 
/ + A = 0 , we have 

(-/) - (-/) + o = (-/)+(/ +A) = [(-/) +/] +A 

= 0 + A = A. Q.E.D. 

Definition 1*2* The vectors f ly —,f n are said to be linearly inde¬ 
pendent if the relation 

c lfl + *" + c nfn — C 1 VJ c n e F 9 

has c x — * * • = c n = 0 as the only solution. A subset S (finite or infinite) 
of a vector space Y is called a set of linearly independent vectors if any 
finite number of different vectors from S are linearly independent. The 
dimension of a vector space Y is the least upper bound (which can be 
finite or positive infinite) of the set of all integers v for which there are v 
linearly independent vectors in 'V'. 

1.3. Dimension of a Vector Space 

When the maximal number of linearly independent vectors in the 
vector space 'V' is finite and equal to n, then by the above definition 'V' 
is n dimensional; otherwise the dimension of 'V' is + °°> an d ^ is said to 
be infinite dimensional. 

Theorem 1 * 2 * If the vector space Y' is n dimensional (n < +°o)> 
then there is at least one set A ,..., f n of linearly independent vectors, and 
each vector / e Y can be expanded in the form 

(1-3) /= «l/l + +“nfn, 

where the coefficients a 1 ,..., a n (which are scalars) are uniquely deter¬ 
mined by A 

Proof. If / = 0, (1.3) is established by taking a x — ••• = a n — 0. 
For / 7 ^ 0, the equation 

(1-4) cf + c 1 f 1 + ••• + c n f n — 0 

should have a solution with c ^ 0 due to the assumption that A v>/n 
are linearly independent, while A A >•••>/«. have to be linearly dependent 
because Y is n dimensional. From (1.4) we get 

/ = ( -ClM/l I ••• + ( c„lc)f n , 
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which establishes (1.3). If we also had 

(1*5) / = + ... + b n f n , 

then by subtracting (1.5) from (1.3) we get 

( a i &x)/x + ••• + ( a n — b n )f n = 0 . 

As /x ,.••>/% are linearly independent we deduce that a 1 —-b 1 = 0,..., 
a n — b n — 0, thus proving that a x a n are uniquely determined 
when/is given. Q.E.D. 

Definition 13* We say that the (finite or infinite) set S spans the 
vector space 'V' if every vector in 'V' can be written as a linear combination 

/ = aji x + + a n h n , h x ,..., h n e S 

of a finite number of vectors belonging to S'; if S' is in addition a set of 
linearly independent vectors, then S is called a vector basis of 'V'. 

Theorem 13* If the set {^x,..., £ m } is a vector basis of the w-dimen- 
sional (n < -\-co) vector space then necessarily m — n. 

Proof . As 'V' is n-dimensional, there must be n linearly independent 
vectors f x ,...,/ n . If the set {^x ,..., £ m } is a vector basis in 'V'^ we can write 


( 1 . 6 ) 


fl ~ a ugl + ” * + a mlgm 


fn &lngl “I - ’ ” “I - G"mngm • 


'Thus, if we try to satisfy the equation 

(1.7) x x f i + • • • + x n f n — 0, 

we get by substituting f x ,...,/ n in (1.7) with the expressions in (1.6) 

(1.8) (a n x i + ••• + v«kx + *•• + («mi*i + *•• + a mn x n )g m = 0. 

Kince g x ,..., g m are assumed to be linearly independent, the above equation 
I ms a solution in x 1 ,..., x n if and only if 


(1.9) 


#ii#i h “I - a ln x n — 0 

a ml x l “I - ’ ” a mn x n ^ 0. 


However, as/x are a lso linearly independent, (1.7) or equivalently 

(1.8) or (1.9) should have as the only solution the trivial one x x = ••• = 
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x n = 0. Now, m < n because Y is n dimensional and g 1 ,...,£ w are 
linearly independent (see Definition 1.2); therefore, (1.9) has only a 
trivial solution if and only if m = n. Q.E.D. 

Definition 1.4. A subset Y x of a vector space Y is a vector subspace 
(linear subspace) of Y if it is closed under the vector operations, i.e., if 
f+g e Y x and afe Y x whenever/, g e Y x and for any scalar a. A vector 
subspace Y x of Y is said to be nontrivial if it is different from Y and 
from the set {0}. 

From the very definition of the dimension of a vector space Y we can 
conclude that the dimension of a vector subspace Y x of Y cannot exceed 
the dimension of Y . 

1.4. Isomorphism of Vector Spaces 

Definition 1.5. Two vector spaces Y 1 and Y 2 over the same field 
are isomorphic if there is a one-to-one mapping Y x onto Y 2 which has the 
properties that if / 2 and g 2 , / 2 , g 2 e Y 2 , are the images of f x and g x , 
fi > £i G > respectively, then for any scalar a, af 2 is the image of af x 

afi <-> «/ 2 > 

and/ 2 + g 2 is the image of f x + g x 

fi +gi<->f2+g*- 

The importance of the isomorphism of two vector spaces Y x and Y 2 
lies in the obvious fact that two such spaces have an identical vector 
structure. It is easy to see that the relation of isomorphism is transitive 
(see Exercise 1.6), i.e., if Y x and Y 2 as well as Y 2 and Y^ are isomorphic, 
then Y x and Y% are also isomorphic. 

Theorem 1.4. All complex (real) n-dimensional (n < +oo) vector 
spaces are isomorphic to the vector space (C n ) [(U n ) in case of real vector 
spaces]. 

Proof. Consider the case of an n-dimensional vector space Y. 
According to Theorem 1.2 there is a vector basis consisting of n vectors 
fi v>/n > an d eac h vector feY can be expanded in the form (1.3), 
where a 1 ,..., a n e C 1 are uniquely determined by/. Consequently 
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is a mapping of Y into ( C n ). Furthermore, this is a one-to-one mapping 
of Y onto (C n ) because to any 



corresponds a unique / = b x f x + * “ + b n f n such that j8 = oc f . It is also 
easy to see that 

f + g\-> oc f+g = a, + , 

= «*/ • 

Since isomorphism of vector spaces is a transitive relation (see Exercise 
1 .6) we can conclude that all n-dimensional complex vector spaces are 
mutually isomorphic, because each of them is isomorphic to (C n ). Q.E.D. 

Exercises 

1.1. Check that the set of all m X n complex matrices constitutes an 
m • n dimensional complex vector space if vector addition is defined as 
being addition of matrices, and multiplication by a scalar is multiplication 
of a matrix by a complex number. 

1*2* Show that the set C 1 of all complex numbers becomes a two- 
dimensional real vector space if vector addition is identical to addition of 
complex numbers, and multiplication by a scalar is multiplication of a 
complex number (the vector) by a real number (the scalar). 

1.3* Show that the family ^(R 1 ) of all complex-valued continuous 
functions defined on the real line is an infinite-dimensional vector space 
if the vector sum/ + g off(x), g(x) e ^(R 1 ) is the function (/ + g)(x) — 
f(x ) + g(x), and the product af of f(x) e ^(R 1 ) with a e C 1 is the 
function ( af)(x) = af(x). The zero vector is taken to be the function 
f(x) EEE 0. 

1.4. Prove that if JT is a family of linear subspaces L of a vector 
space Y'y then their set intersection f) LeJ fL is also a vector subspace of Y'. 

1.5* Show that if S is any subset of a vector space Y, then there is 
a unique smallest vector subspace Y s containing S (called the vector 
Bubspace spanned by S). 

1.6* Verify that the relation of isomorphism of vector spaces is: 

(a) reflexive, i.e., every vector space Y is isomorphic to itself; 

(b) symmetric, i.e., if Y x is isomorphic to Y^ , then Y^ is isomorphic 
to Y x \ 
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(c) transitive, i.e., if ^ is isomorphic to and ^ is isomorphic 
to , then ^ is isomorphic to . 

1.7* Prove that the following subsets of the set ^(U 1 ) (see Exercise 
1.3) are vector subspaces of the vector space ^°( R x ): 

(a) the set of all polynomials with complex coefficients; 

(b) the set 3P n of all polynomials of at most degree n. 

Show that D 3P n . 

2. Euclidean (Pre-Hilbert) Spaces 

2.1. Inner Products on Vector Spaces 

A Euclidean (or pre-Hilbert or inner product or unitary) space S is a 
vector space on which an inner product is defined. The Euclidean 
space is called real or complex if the vector space on which the inner 
product is defined is, respectively, real or complex. 

Definition 2.1. An inner (or scalar ) product <• | •> on the complex 
vector space 'V' is a mapping of the set 'V' X 'V' into the set C 1 of complex 
numbers 

(/, g)»<f I g\ (/, g)e'Tx'r, <f\g>e C\ 

which satisfies the following requirements: 

(1) </!/)> 0, for all /* 0, 

(2) <J\g'> = <g\f>*, 

(3) <J\ag> = a<J\g->, aeC\ 

(4) </l^ + A> = </l^> + </|A>. 

Note that by inserting f — g = h — Oin Point 4 we get <0 | 0) — 0. 

Following a notation first introduced by Dirac [1930] and widely 
adopted by physicists, we denote the inner product of /and g by </ | £>. 
Mathematicians often prefer the notation (/, g) and replace Point 3 in 
Definition 2.1 by 

W,g) = «(/,£)• 

The above definition can be easily specialized to real vector spaces, 
in which case the inner product </1 £> is a real number, and Point 2 of 
Definition 2.1 becomes </|£> — <£ |/>. As in quantum physics we 
deal almost exclusively with complex Euclidean spaces, we limit ourselves 
from now on to the complex case. Consequently, if not otherwise stated, 
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whenever we talk about a Euclidean space, we shall mean a complex 
Euclidean space. 

Theorem 2*1 ♦ In a Euclidean space <f, the inner product </|£> 
satisfies the relations 

(a) <af\g} = a*(f\g), 

(b) <f + g\h > = </|A> + <*|A>. 

The proof is obtained by a straightforward application of Points 1-4 
in Definition 2.1: 

0/l£> = <£ I «/>* = I/)]* = «*<£ l/>* = «*</ I g>, 

<f + g\ h> = <h \f + g>* = [<A |/> + <A I £>]* = <A |/>* + <A | £>* 

= </1 h> + <g I A>. 

As an example of a finite-dimensional Euclidean space, we can take 
the vector space (C n ) defined in the preceding section, in which we 
introduce as the inner product of the vectors a and /? with the Ath 
components a k and b k , 

O | py = a !*&! 4- a 2 *b 2 4- ••• 4- a n *b n . 

It is easy to check that the above mapping of (C n ) X (C n ) into C 1 satisfies 
the four requirements of Definition 2.1. We shall denote the above 
Euclidean space with the symbol l 2 (n). 

An example of an infinite-dimensional Euclidean space is provided by 
the vector space of all continuous complex-valued functions 

/ (#) on the real line which satisfy 

/»+°° 

(2.1) \f(x)\ 2 dx < 4~oo, hm f(x) = 0, 

j _0O «-»±QD 

in which the inner product (see Exercise 2.1) is 

/*+°° 

(2.2) </!*>= | f*(x)g(x)dx. 

J —oo 

Theorem 2*2* Any two elements/, g of a Euclidean space $ satisfy 
the Schwarz-Cauchy inequality 

\<f\g>\ 2 <<f\fXg\g>- 

Proof. For any given /, £ e and any complex number a we have, 
from property 1 in Definition 2.1 and the comment following it, 


</ I 1/ I «£> > 0. 
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In particular, if we take in the above inequality 


a 


= A 


<f\g>* 

\<f\g>\ 


A = A*, 


we easily show that the inequality 


*(A) = A*<* \g> + 2A |</1 *>| + </1/> > 0 

is true for all real values of A. A necessary and sufficient condition that 
5(A) ^ 0 is that the discriminant of the quadratic polynomial g(X) is 
not positive 

l</ \g>\ 2 — <f\fXg\g><0, 

from which the Schwarz -Cauchy inequality follows immediately. Q.E.D. 


2.2. The Concept of Norm 

The family of all Euclidean spaces is obviously contained in the family 
of vector spaces. There is another family of vector spaces with special 
properties which is of great importance in mathematics: the family of 
normed spaces. 

Definition 2.2. A mapping 

/-ll/ll, fer, ll/ll eR\ 

of a complex vector space 'V' into the set of real numbers is called a norm 
if it satisfies the following conditions: 

(1) ||/|| >0 for 0, 

( 2 ) 11011 = 0 , 

(3) ||«/|| = Mll/ll for all a e C 1 , 

(4) 11/ + g II < II /II + II g II (the triangle inequality). 

We denote the above norm by || * ||. 

For a real vector space, we require in Point 3 that a e (R l . 

The last requirement in Definition 2.2 is known as the triangle inequal¬ 
ity because it represents in a two- or three-dimensional real vector 
space a relation satisfied by the sides of a triangle formed by three vectors 
/> g and / + g. 

A real (complex) vector space on which a particular norm is given is 
called a real (complex) normed vector space. A Euclidean space is a 
special case of a normed space; this can be seen from the following 
theorem. 
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Theorem 23 ♦ In a Euclidean space S with the inner product 
</ | g} the real-valued function 

(2-3) ||/|| - 

is a norm. 

Proof. The only one of the four properties of a norm which is not 
satisfied by (2.3) in an evident way is the triangle inequality. We easily get 

11/ + g\\ 2 = </ +g\f + g> = </ +/> + </ I g> + <g I/> + <g\g> 

(2.4) = </1 /> + 2Re</1 £> + <£ | £>• 

From the Schwarz-Cauchy inequality we have 

I Re</ |^>| < \(f \g}\ <||/||||^|| 5 

which when inserted in (2.4) yields 

ll/ + £ll 2 < ll/ll 2 + 21|/|| Ik II + lkll 2 = (ll/ll + 11^ II) 2 . 

The above relation leads immediately to the triangle inequality. Q.E.D. 

2.3. Orthogonal Vectors and Orthonormal Bases 

Some elementary geometrical concepts valid for real two- or three- 
dimensional Euclidean spaces can be generalized in a straightforward 
manner to any Euclidean space. 

Definition 23 ♦ In a Euclidean space S two vectors / and g are 
called orthogonal, symbolically / _]_ g; if </ \ g} — 0. Two subsets R 
and S of $ are said to be orthogonal (symbolically, R _|_ S) if each vector 
in R is orthogonal to each vector in S. A set of vectors in which any two 
vectors are orthogonal is called an orthogonal system of vectors. A vector / 
is said to be normalized if ||/|| = 1. An orthogonal system of vectors is 
called an orthonormal system if each vector in the system is normalized. 

Theorem 2*4* If S is a finite or countably infinite set of vectors in 
a Euclidean space S' and (S') is the vector subspace of S spanned by S , 
then there is an orthonormal system T of vectors which spans (S'), i.e., 
for which ( T) = (5); T is a finite set when S is a finite set. 

Proof. As the set S is at most countable we can write it in the form 

S={/l,/2,-..} 

by assigning each vector in S to a natural number. In general some of 
the vectors in S might be linearly dependent. We can build from S 
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another set S 0 of linearly independent vectors spanning the same 
subspace (S'), i.e., such that (S 0 ) — (S'), by the following procedure 
(which should be applied consecutively on n — 1, 2,...): if/ n is the zero 
vector or is linearly dependent on^ ,...,/ n _i, then discard it; otherwise 
include it in S 0 . Thus we get a set S 0 of linearly independent vectors 

So = tel, £2 (So) = (S). 

We can obtain from S 0 an orthonormal set T such that (T) — ( S 0 ) 
by the so-called Schmidt (or Gram-Schmidt) orthonormalization procedure. 
Since g x ^ 0, we can introduce the vector 


which is normalized. Proceeding by induction, assume that we have 
obtained the orthonormal system of vectors e 1 e n _ x . Then e n is 
given by 

^ gn (&n —1 1 gn) ^n —1 *** ^1 I gn") g l 

W ~ I \gn~ On-1 I gn) ^n—1 - -Oil gn> *1 || ’ 

The above vector is certainly well defined, since the denominator of the 
above expression is different from zero; namely, if it were zero, then we 
would have 

gn On—i | gw) € n —i *** Ol | gn) ^1 — 

i.e., gn would depend on e x e n _ 1 . However, by solving the equations 
for e 1 e n _ 1 , it is easy to see that we have 


gi — c i,i e i 


gn—1 c n— 1,1^1 “f* ^w-1,2^2 *** c n—l,n—l^n —1 , 

and therefore if g n depended on e x e n _ 1 , then it would also depend 
on gi,—,gn-i> contrary to the fact that S 0 consists only of linearly 
independent vectors. 

The vectors of T are obviously normalized. In order to prove that T 
is an orthonormal system, assume that we have proved that Oi | e)} = S^ 
for i , j — 1,..., n — 1. Then we have for m < n 

1 / \ 

Ora I &n) it / ” | \ [T (Ora I griy ^ Ofc I gn) * ^km ) 0, 

II gn ~ *** ~ \ e l I gl) e l II x 1 

which proves that (e { | = 8^ for i 9 j = 1,..., n. Thus, by induction T 

is orthonormal. 
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As we have for any n that e 1 e n can be expressed in terms of 
Si >•• • > Sn > an d vice versa, we can conclude that ( T) = (S 0 ). Q.E.D. 

2.4. Isomorphism of Euclidean Spaces 

We introduce now a concept of isomorphism of Euclidean spaces, 
which makes two isomorphic Euclidean spaces identical from the 
point of view of their vector structure as well as from the point of view of 
the structure induced by the inner product. 

Definition 2*4* Two Euclidean spaces and S 2 with inner products 
<• | •>! and <• | *> 2 , respectively, are isomorphic (or unitarily equivalent) 
if there is a mapping of onto $<> 

such that if for any A the vector f 2 e S 2 is the image oi A and the 

vector g 2 e S 2 is the image of g 1 , then 

fi +£1^/2 +g 2 , 

a fi ^ fl/a» a G C 1 , 

C/i I £1)1 — (f 2 \ £2)2 • 

A mapping having the above properties is called a unitary transformation 
of onto S 2 . 

Theorem 2*5* All complex Euclidean w-dimensional spaces are 
isomorphic to l\n ), and consequently (see Exercise 2.8) mutually 
isomorphic. 

Proof. If $ is an w-dimensional Euclidean space, there is according 
to Theorem 1.2 a set of n vectors/ 2 ,...,/ n spanning S. According to 
Theorem 2.4, we can find an orthonormal system of n vectors e 1 ,..., e n 
which also spans $. It is easy to check (see Exercise 2.7) that the mapping 

(2.5) /<-> | j j, = <«i | a n = ( e n |/>, 

provides an isomorphism between $ and l 2 (n). Q.E.D. 

Obviously, a similar theorem can be proved for real Euclidean spaces. 

Theorem 2*6* A unitary transformation 

A ^ A » A e ^1 ♦ /a e » 


( 2 . 6 ) 
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of the Euclidean space onto the Euclidean space <o 2 has a unique 
inverse mapping which is a unitary transformation of S 2 onto . 

Proof. We note that since 

Wfi-gi 111 = 11/.-ft II., 

the images / 2 and g 2 of and g x , respectively, are distinct whenever 
f 1 ^g 1 . Since the unitary map of is onto <o 2 , we conclude that the 
inverse of the mapping (2.6) exists. 

We leave to the reader the details of the remainder of the proof. 

Exercises 

2*1 ♦ Show that for a finite interval I 

<f\g> = jf*(x)g(x)dx 

is an inner product on the vector space 

2*2* Show that the vector space ^( 2 )(^ 1 ) introduced in Section 2.1 is 
a subspace of the vector space 

2*3* Prove that (2.2) is an inner product in ^^((R 1 ). 

2A. Show that 

\<f\g>\ 2 =<f\fXg\g>, 

\\f+g\\ = \\f\\ + \\g\\, 

if and only if either / is a multiple of g , i.e., if / = ag, a e C 1 , or g = 0, 
and if in addition a ^ 0 in case of the second relation. 

2*5* Show that if T is an orthonormal system of vectors, then all the 
vectors in T are necessarily linearly independent. 

2*6* Prove that a subspace of a Euclidean space is also a Euclidean 
space. 

2*7* Show that the mapping (2.5) is a mapping of $ onto l 2 (n), and 
that it satisfies the requirements of isomorphism given in Definition 2.4. 

2*8* Show that the relation of isomorphism of inner-product spaces 
is an equivalence relation, i.e., it is (see Exercise 1.6) reflexive, symmetric, 
and transitive. 
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3* Metric Spaces 

3.1. Convergence in Metric Spaces 

In an n-dimensional Euclidean space $ we can always find, due to 
Theorems 1.2 and 2.4, a basis of n vectors e x ,..., e n which constitute an 
orthonormal system. We can then expand any vector/of $ in that basis 

(3-1) f=f j a k e k . 

k =1 


We easily see that a k ™ (e k \ /). 

In an infinite-dimensional Euclidean space not every vector can be 
expanded in general in terms of a finite number of vectors. We can hope, 
however, to replace (3.1) with the formula 

00 

f y , tijc&ic > 

k=1 

but then we meet with the problem of giving a precise meaning to the 
convergence of the above series. This problem is solved in its most 
general form in topology, but for our purposes it will be sufficient to 
solve it within the context of metric spaces. 

Definition 3*1 ♦ If S is a given set, a function ? 7 ) on 5 X S 

is a metric (or distance function) if it fulfills the following requirements 
for any £, 77 , £ e S: 

( 1 ) d(g, 7 ]) > 0 if | i=- 7], 

(2) d(f,£) = 0, 

(3) d(£,ri) = d{ri,€), 

(4) <*(£,£) ^ 7]) -f- d( 77, £) (triangle inequality). 

A set S on which a metric is defined is called a metric space . 

A metric space does not have to be a linear space. For instance, a 
bounded open domain in the plane becomes a metric space if the metric 
is taken to be the distance between each pair of points belonging to that 
domain; such a domain obviously is not closed under the operations of 
adding vectors in the plane, but it provides a metric space. 

Generalizing from the case of one-, two-, or three-dimensional real 
Euclidean spaces, we introduce the following notions. 

Definition 3*2* An infinite sequence % x , £ 2 ,... in a metric space 
is said to converge to the point £ e M if for any c > 0 there is a positive 
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number N(e) such that £ n ) < € for all n > N(e). An infinite sequence 

^ is called a Cauchy sequence (or a fundamental sequence) if for any 

e > 0 there is a positive number M(e) such that d(£ m , £ n ) < e for all 
m,n > M(e). 

Theorem 3If a sequence £i, £ 2 >— m a metric space con¬ 
verges to some £ e then its limit £ is unique, and the sequence is 
a Cauchy sequence. 

Proof. If ^ converges to £ e Jt and to rj e */#, then by defini¬ 

tion, for any e > 0 there are N^e) and A^e) such that d{ £, £ n ) < € for 
n > A/^e) and d(rj, £ n ) < e for n > Af 2 (e). Consequently, for 
n > max(A 1 (e), N 2 (e)) we get by applying the triangle inequality of 
Definition 3.1, Point 4, 

d(L rj) < $ n ) + d($ n , 77) < 2e. 

As e > 0 can be chosen arbitrarily small, we get d(£, rj) = 0, which, 
according to Definition 3.1, can be true only if £ = rj. 

Similarly we get 

d($m^n)<d($ m ^)+d(t $ n )<* 

if m , n > N^ejl)] i.e., the sequence , f 2 ,... is a ls° a Cauchy sequence. 

Q.E.D. 


3.2. Complete Metric Spaces 

In case of sequences of real numbers, every Cauchy sequence is 
convergent, i.e., the set R 1 of all real numbers is complete. We state 
this generally in Definition 3.3. 

Definition 33* A metric space Jt is complete if every Cauchy 
sequence converges to an element of Jt. 

Not every metric space is complete, as exemplified by the set 91 of all 
rational numbers with the metric dfaj^ , m 2 \nf) = | m Y \n x — m 2 jn 2 |, 
which is incomplete. However, we know that the set 91 is everywhere 
dense in the set IR 1 ; we state this generally as follows: 

Definition 3*4* A subset S of a metric space Jt is {everywhere) dense 
in Jt if for any given e > 0 and any Jt there is an element rj belong¬ 
ing to S for which d(£, rj) < e. 

We can reexpress the above definition after introducing a few topo¬ 
logical concepts, generalized from the case of sets in one, two, or three 
real dimensions. 
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Definition 3*5* If £ is an element of a metric space Jt, then the set 
of all points 77 satisfying the inequality d(£, 77 ) < € for some e < 0 is 
called the e neighborhood of £, If S' is a subset of */#, a point £ e is 
called an accumulation (or cluster or limit) point of 5 if every e neigh- 
borhood of £ contains a point of S. The set S consisting of all the cluster 
points of S is called the closure of S. Obviously always S C S; if S — S 
then S is called a closed set. 

We say that the subset S of a metric space is (1 everywhere) dense in Ji 

if and only if Ji is its closure, i.e., if and only if S — Ji. 

The procedure of completing the set of rational numbers by embed¬ 
ding it in the set of all real numbers can be generalized. 

Definition 3*6* A metric space ,/ffl is said to be densely embedded 
in the metric space Jt if there is an isometric mapping of Ji into Ji ’, 
and if the image set Ji* of Ji in Jt is everywhere dense in Ji . 

A one-to-one mapping £ «-> | of a metric space Ji into another metric 
space Ji is called isometric if it preserves distances, i.e., if ^i(|, 77 ) — 
d 2 (£, rj) for 1,77 e <Ji and £, 77 e Ji whenever £ <-> £ and 77 <-» 77 . 

3.3. Completion of a Metric Space 

^Theorem 3*2* Every incomplete metric space Ji can be embedded 
in a complete metric space Ji , called the completion of Ji. 

The proof of this theorem can be given by generalizing Cantor’s 
construction, by which one builds the set of real numbers from the 
rational numbers. 

Denote by Ji ^ the family of all Cauchy sequences in M. If £' = 
{£/, £ 29 --} and £" = {£i\ are two such sequences, we say that 

they are equivalent if and only if 

(3.2) lirn d{£ n \ £ n ") = 0. 

00 

It is easy to see that we have thus introduced an equivalence relation 
inJ 8 (see Exercise 3.1) if we recall (see Exercises 1 .6 and 2,8) the general 
definition of an equivalence relation. 

Definition 3*7* A relation £ ~ 77 holding between any two ordered 
elements of a set S is called an equivalence relation if it is 

(1) reflexive: £ ~ £ for all feS; 

( 2 ) symmetric: £ ~ rj implies that 77 ~ £; 

(3) transitive: £ ~ r; and 77 ~ £ implies that £ ~ £. 
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A subset X of S having the property that all the elements of X are 
equivalent and that if 77 ^ £ and £ e X then 77 e X is called an equivalence 
class (with respect to the equivalence relation ^). 

We denote the family of all equivalence classes in Ji s [with respect to 
the equivalence relation given by ( 3 . 2 )] by the symbol Ji, and agree to 
denote the equivalence class containing the Cauchy sequence | also by 
Consequently if e Ji, then if and only if the Cauchy 

sequences e*# 8 are equivalent, i.e., satisfy (3.2). 

We introduce the real function d B (£, rj) on Ji s X Ji s by defining 
for 1 = 1 ^, £ 2 and rj = , V2 

(3.3) <4(1, rj) = lim d(( n , rj„). 

n-*oo 

In order to see that the above limit exists for any rj e Ji B we employ 
the relation (see Exercise 3.2) 

(3*4) | d(J^ m , 1 7 m ) d(J^ n , rj n )\ ^ d(£ m , £ w ) -|- d(rj m , 7^ w ) 

to show that ^(^ , r; x ), d (£ 2 , ^ 2 )v is a Cauchy sequence of numbers, 
and therefore has a limit; namely as ^ , £ 2 ,... and 771 , r) 2 ,... are Cauchy 
sequences, we can make rf(£ m , £ n ) < e if m,n > N 1 (e) f and d(rj m , y n ) < e 
if m, n > N 2 (<e), which, used in conjunction with (3.4), proves the state¬ 
ment. 

We can show that d B (£, rj) also defines a real function on Ji X Ji by 
establishing that d B (i', rj') — d B (g", rj") if — l" and rj' = rj" for 

&Ji- We first obtain that d B (£', rj') = d B (£", rj') from the 
inequality (see Exercise 3.3) 

<*(£»', (n) 

because d(£ n ', £ n ") —> 0 as n —> 00 due to the fact that the Cauchy 
sequences and £" belong to the same equivalence class. Similarly we 
can show that d B (£", rj') = d B (£", rj"), and thus prove that d B (%', rj') = 

It is easy to check that the function d B (£, rj) defines a metric on Ji 
(see Exercise 3.4). We show now that the ensuing metric space, which 
we denote also by Ji , is complete. 

Assume that | (1) , £ (2) ,... is a Cauchy sequence in Ji , where | (fc) is 
the equivalence class containing the Cauchy sequence {£i fc) , °f 

elements of Ji. Choose for each integer k an element 77 /c = i; (k) 6 Ji 
such that d(£ {k) , rj k ) =-- d(^ k) , % {k) ) < 1 //? for all m greater than some N k \ 
this is certainly possible because is a Cauchy sequence in M, 
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Consider now the elements rj k = , r) k and #<*> = {£%>, 

of Ji B . We obviously have 

ds(t\v k ) = 

If we let in the above relation m -> oo, then we find that d^ (k \ —► 
ds(£ {k \ rj k ) since lim £ {k) ) = 0 as w -> oo (see Exercises 3.5 and 3.6) 

and consequently 

ds(i {k \ rj k ) < 1/A. 

We can now deduce that rj = {^71 , ^7 2 >-•-} * s a Cauchy sequence in di 
by writing 


(3*5) d(yj m , rj n ) ds(jj m 5 ^n) 

< d s (im , f (m) ) + *(£<"*>, f *>) + <4(f 

Since £ (1) , £ (2) ,.«. is a Cauchy sequence in*/#, we can make d B (£ {m) , £ in) ), 
and consequently the entire right-hand side of (3.5) arbitrarily small for 
all sufficiently large m and n. Thus, rj e Ji B . 

We can establish that the equivalence class rj e Ji containing the 
Cauchy sequence rj = , rj 2 is the limit of £ (1) , £ (2) ,... if we write 

(3.6) d s (rj, f<*)) < d s (rj, rj k ) + d s (rj k , £<*>). 

The right-hand side of (3.6) can be made arbitrarily small for sufficiently 
large k because d B (rj k , £ {k) ) ^ 1/A and lim^oo d B (rj y rj k ) — 0 (see Exercise 
3.6). 

In order to finish the proof of the theorem, we have to embed dt into 
the complete metric space Ji. To that purpose we map £ e M into the 
equivalence class £ containing the sequence {£, This mapping is 

obviously one-to-one and isometric, as d(£, rj) = d B ( £, rj). Furthermore, 
the image Jf of di in Ji is everywhere dense in Ji\ namely if rj e Ji 
contains {rj 1 ,rj 2 ,...} eji B , then for arbitrary e > 0 we can choose an 
rj k in dt' containing {r] k , rj k and such that d B (rj y rj k ) < e. 

Exercises 

3 J ♦ Show that the relation £ ~ rj between any two Cauchy sequences 
£ = , £2 an( l V ™ {Vj , V 2 >•••} °f a metric space Ji- y defined to 

mean that d(£ n , rj n ) ~~ 0, satisfies the three requirements given 

in Definition 3.7 for an equivalence relation. 
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3*2* Prove that any four elements £ 1 , £ 2 > > V 2 °f a metric space dt 

satisfy the relation 

I d (ti, £2) — d (v 1 ,Vz)\ < <*(£i, ’h) + ^(£2 > %)• 

3.3. Prove that if £, rj, 4 are elements of a metric space,///, then 

I -<*(£, 01 < <%, 0- 

3.4. Show that the function d s (£, rj) defined on Jt X by (3.3) 
satisfies the four requirements for a metric (those requirements are 
formulated in Definition 3.1). 

3*5* If in a metric space di the sequence ^ , £ 2 ,... converges to £, 
prove that for any rj e df, lim n _,oo d(£ n , rj) = d(£, 37 ). 

3*6* If £ is the equivalence class of M (introduced in Theorem 3.2) 
containing the Cauchy sequence {£ x , £ 2 ,...} eJ ? s , then for any e > 0 
there is an N(e) such that d s (£, £ k ) < e for all k > N(e), where £ k = 
{£ k , £ k ,...}. Prove this statement! 

3*7* Show that if S 1 is an everywhere dense subset of a metric 
space dt y and S 2 is an everywhere dense subset of , then S 2 is every¬ 
where dense in di. 


4* Hilbert Space 

4 . 1 . Completion of a Euclidean Space 

It is easy to establish (see Exercise 4.1) that in normed space 

(4.1) d(f,g)=\\f-g\\ 

is a metric. Therefore, we can define in JT convergence, completeness, 
etc. in the metric (4.1), which is then called convergence , completeness , etc. 
in the norm. A complete normed space bears the name of Banach space . 

The above concepts can also be applied to Euclidean spaces, because 
according to Theorem 2.3 we can introduce in such spaces a norm, and 
therefore also a metric. A Euclidean space which is complete in the 
norm* is called a Hilbert space. 

Not every Euclidean space is a Hilbert space. For instance, the 

# The concept of completeness can be defined and considered for other topologies 
besides the norm topology. 
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Euclidean space introduced in §2 is not complete. To see this, 

note that the sequence ,/ 2 of continuous functions 

( 42 ) fM-l 1 ^ 1*1 

{ > Jn( ) (exp[—n 2 (| x | - a) 2 ] for \x \ > a, 

is a Cauchy sequence in ^^(R 1 ) ( see Exercise 4.2) but it does not con¬ 
verge to an element of ^ 2 )(^ 1 )- In fact, it is easy to establish that with 
increasing n, the functions in the above sequence approximate arbitrarily 
closely in norm the discontinuous step function 

y(x )=\ l ^ 1*1 
n > (0 for | x | > a. 


which, however, does not belong to ^^((R 1 ). 

Definition 4*1* We say that the Euclidean space S can be densely 
embedded ih the Hilbert space Jf 7 if there is a one-to-one mapping of S 
into e ^, such that the image S' of S is everywhere dense in Jf 7 , and the 
mapping represents an isomorphism between the Euclidean spaces S 
and S'. 

Theorem 4*1 ♦ Any incomplete Euclidean space S can be densely 
embedded in a Hilbert space. 

Proof. Denote by S the complete metric space built from the set S B 
of Cauchy sequences in S according to the procedure used in proving 
Theorem 3.2. Define in S B the operations 

^4 3) / + I = {/i + £i j /2 +£2 >•••}> 

= Wi, >•••} 

for any two sequences /= {f x ,/ 2 | = , g 2 from / s . It is 

easy to check that the above operations are operations of vector addition 
and multiplication by scalar. Furthermore, if /'=/", where /' = 
{/i'» fz,-} and/" =J/i",/ 2 ",-}» i- e > if /' and /" belong to the same 
equivalence class in S and therefore 

Hm || || = lim d(f n ',f n ") = 0 , 


then || af n ' — af n " || = | a \ || /„' —|| -> 0 ; thus, we also have that 
r+s=r + g and af' = af". Consequently, (4.3) defines vector 
operations on S 9 which thus becomes a vector space. 

We now introduce the complex function on S B X $ B defined by 

(4.4) </ll>8= lim </ w | *„>. 
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The limit in (4.4) exists because </i | gf), is a Cauchy 

sequence of numbers, as can be seen from the following inequality: 

K/m I gm) Cf n I gii)\ ^ K/m fn I gm) Cf n I gm gn}\ 

^ II fm fn II II gm II + II fn II II gm ~ gn II*, 

namely, \\f n || -► d s (f, 0) = H/|| s and || g n || -> || g IJ S for n oo, where 

II/IIb = V</|/>«, 

while ||/ w — / n || and \\g m ~~g n \\ can be made arbitrarily small for 
sufficiently large m and n. 

Furthermore, if /' = f" e «f s , then </' | |> s = </" | |> s , as can be 
concluded from the inequality 

Kfn I gn> ~ <fn' I gn >I < II fn ~ fn" II II gn II- 

Hence, </1 g) B is a uniquely defined function on S X This function 
determines an inner product on £ (see Exercise 4.4). It is obvious that 
the mapping/*-*/ — {/,/,.••} of £ into £ has an image £’ which is a 
linear subspace of £\ according to the construction, £' is everywhere 
dense in £ and the above mapping provides an isomorphism between $ 
and<^\ Q.E.D. 

A similar theorem can be proved for normed spaces (see Exercise 4.5). 
4.2. Separable Hilbert Spaces 

In quantum mechanics we deal at present almost exclusively with a 
special class of Hilbert spaces which are called separable. 

Definition 4*2* The Euclidean space £ is called separable if there 
is a countable everywhere dense subset of vectors of £. 

In the early days of research on Hilbert spaces, separability was taken 
to be an integral part of the concept of a Hilbert space. 

In quantum mechanics we are concerned primarily with separable 
complex Hilbert spaces. We shall agree that in the future whenever we 
refer to a space as a Hilbert space we mean a complex Hilbert space, 
except if otherwise stated. 

Theorem 4*2* Every subspace of a separable Euclidean space is a 
separable Euclidean space. 

Proof. The fact that a subspacc £ x of a Euclidean space £ is also a 
Euclidean space is easy to check (see Exercise 2.6). In order to establish 
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the separability of S x , construct a countable subset S — {g xl , g m , 
£21 ,&»•••} of <?! in the following way. 

Let R — {f x ,/ 2 be a countable everywhere dense subset of 
there has to be such a set because of the separability of $. Let g mn denote 
a vector of S x satisfying \\g mn — f n || < 1 /m, in case there is at least one 
such vector, or the zero vector in case there is no vector of S x in the 1 jm 
neighborhood of/ n . 

The set S is everywhere dense in S x because for any given h e (o x and 
m > 0 we can find an/ n e R such that || h — f n || < 1/m. Thus, by the 
above rule of constructing S we certainly have g mn 0 and therefore 

il h - g mn || < || h - f n || + ||/ w - g nm || < 2/m. 

This proves that 5 is everywhere dense in & x . Q.E.D. 

4.3. / 2 Spaces as Examples of Separable Hilbert Spaces 

As an important example of an infinite-dimensional separable Hilbert 
space we give the space / 2 ( 00 ), which is basic in matrix mechanics. 

Theorem 43♦ The set / 2 ( 00 ) of all one-column complex matrices a: 
with a countable number of elements 


a 


a x 

#2 


for which 


(4.5) 


Li 


1 


a Jc 


I 2 < +00 


becomes a separable Hilbert space, denoted by Z 2 ( 00 ), if the vector opera¬ 
tions are defined by 

l a \ + ^i\ 

(4.6) oc + /3 = I a 2 + b 2 1, 


(4.7) 


aoc = 


dd x 

aa 2 


a e C 1 , 


and the inner product by 
(4.8) <a 


w 

/3> = L a k* b k • 
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Proof. The operation (4.7) maps C 1 X Z 2 (oo) into Z 2 (oo) because 
EkLi I aa k | 2 = | a | 2 Z £ =1 I a k | a < +oo if (4.5) is satisfied. 

In order to see that (4.6) maps Z 2 (oo) X Z 2 (oo) into Z 2 (oo), apply the 
triangle inequality on the v-dimensional space Z 2 (v), v < + oo, in order 
to obtain 

v ”i 1/2 r v 11/2 r i- 

Z I a * + b * I 2 < Z I a * I 2 + Z I h l 2 

&=i J Lft-i J L k=i 

The above inequality shows that when v —> oo, the left-hand side 
converges if a, j3 e Z 2 ( oo). 

Similarly, we prove that (4.8) converges absolutely by applying the 
Schwarz-Cauchy inequality on the v-dimensional space Z 2 (v), v < +oo, 
in order to obtain 

v v r v *] 1/2 r v ii/2 

Z K***I = Z i«*ii*»i < Z i «*= l 2 Z I **= l 2 • 

*=i &=i L&=i J L/c-i J 



We leave as an exercise for the reader (see Exercise 4.6) to check that 
we deal indeed with a Euclidean space. 

To prove that this Euclidean space is complete, assume that a (1) , a (2) ,... 
is a Cauchy sequence, where 


a (n) = \ a (n) 1 


As we obviously have that for any k = 1, 2,... 


a™ | < || a (m) 


we deduce that for fixed k the sequence is a Cauchy sequence 

of complex numbers; hence, this sequence has a limit b k . We shall prove 
that the one-column infinite matrix 

( b i\ 


is an element of Z 2 (oo), and that a (1) , oc (2) ,... converges in the norm to j3. 

By applying again the triangle inequality on the v-dimensional space 
Z 2 (v), v < +°o, we obtain 
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the above inequality is true for any m = 1, 2,... . As a (1) , a (2) ,... is a 
Cauchy sequence, there is for any given e > 0 an N 0 (e) such that for any 
m,n> N 0 (e) and any positive integer v 

Y | a — a<. w) | 2 < || oc {m) — oc < n > || 2 < € 2 /4. 

fc=i 

On the other hand, as = lim^*, we can find for fixed v an 
N v (e) such that | b k — <4 W) | < € /2 (fc+1) / 2 for any w > N v (e) and all 
k = 1, 2,..., v. Thus, we get from (4.9) that for all n > AT 0 (e) and all 
positive integers v 

<4-10) [f l K - « (il^)+ § 


< 



e 



As the right-hand side of the above inequality is independent of v and 
the inequality itself is true for any n > we can let v —> oo in 

(4.10) to derive 


(4.11) 


i\K-< n) i 2 

Lfc=i J 


1/2 




for all n > N 0 (e). 


By returning again to / 2 (v), v < -f °o, to obtain 

' V 11/2 r V 1 1/2 r V 1 1/2 

ei*j 2 < + zi^’i 2 

L/c-i J L/c-i J L/c=i J 


we establish that j3 e / 2 (oo) by letting v -> oo. The relation (4.11) tells us 
now that a (1) , a (2) ,.-- converges to 

Finally, in order to prove the separability of / 2 (oo), consider the set D 
of all the one-column matrices a from / 2 (+oo) with kth components 
(oc) k = a k , where a k has rational numbers as its real and imaginary part, 
i.e., Re a k , Im a k e 5 R, k = 1, 2,..., and in addition 

(4.12) a n+1 = a n+2 = — = 0 

for some integer n. The set D is countable (see Exercise 4.7). In order 
to prove that D is everywhere dense in / a (oo), take any one-column 
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matrix y e Z 2 (-foo) with Mi component (y) k = c k . As y e Z 2 ( oo), there 
is for any given e > 0 an integer n such that 

Z k*l 2 <* 2 /2. 

fc=w+l 

Furthermore, as the set 91 of rational numbers is dense in the set U 1 of 
real numbers, we can choose an a e Z) which satisfies (4.12) and is such 
that | c k — a k | < el'\/2n for all k — 1,..., n. Thus, we have 

r n oo li/2 

ll y ~ “ll = Z l c k — a * l 2 + Z l I 2 1 < e > 

Lfc=l k=n +1 J 

which proves that Z 2 (oo) is separable. Q.E.D. 

4.4. Orthonormal Bases in Hilbert Space 

In an infinite-dimensional Euclidean space it is important to distin¬ 
guish between the vector space ( S ) spanned by a set S , and the closed 
vector space [S] spanned by S . 

Definition 43♦ The vector space (or linear manifold) (S) spanned by 
the subset S of a Euclidean space $ is the smallest* subspace of S’ con¬ 
taining S . The closed vector subspace [S] spanned by S is the smallest 
closed vector subspace of $ containing S'. 

In the finite-dimensional case ( S) — [S] because all finite-dimensional 
Euclidean spaces are closed (see Exercise 4.8). That this is not so in the 
infinite-dimensional case can be deduced from the following theorem, 
whose simple proof we leave to the reader (see Exercise 4.9). 

Theorem 4*4* The subspace ( S) of the Euclidean space $ spanned 
by the set S is identical with the set of all finite linear combinations 
a ifi + ••• + & n fn °f vectors from S, i.e., in customary set-theoretical 
notation, 

(S) = *** a n f n : 9 *** 9 f n £ S , a~i ,..., a n 6 C 1 , n = 1, 2,...}. 

The closed linear subspace [S] spanned by S is identical to the closure ( S) 

oi(S). 

Definition 4*4* An orthonormal system S of vectors in a Euclidean 
space is called an orthonormal basis (or a complete orthonormal system) 
in the Euclidean space $ if the closed linear space [ S ] spanned by S is 
identical to the entire Euclidean space, i.e., [S] = &. 

# That is, if i r is a subspace of $ and S C if % then necessarily (S) C 'f\ 
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It must be realized that an orthonormal basis T in an infinite-dimen¬ 
sional Euclidean space S is not a vector basis for the vector space S, i.e., 
(T) is in general different from S . For instance, this is so with the basis 
e 1 , e 2 >••• °f / 2 (oo), where e m is the vector whose nth matrix component is 

m)n ^mn * 

Theorem 4.5. A Euclidean space S is separable if and only if 
there is a countable orthonormal basis in S. 

Proof, (a) To prove that in a separable Euclidean space S there is a 
countable orthonormal basis, note that due to the separability of S 
there is a countable set S — {/i ,/ 2 >•••} which is everywhere dense in S . 
According to Theorem 2.4, there is a countable orthonormal system 
T = {e x , e 2 such that ( S) = (T). Due to Theorem 4.4 we have then 

[T] = (T) = (S) = [5] = S. 

(b) Conversely, to show that if there is a countable orthonormal basis 
T = {^i , e 2 >•••}> then S is separable, consider the set 

R = {r^! 4- ••• + r n e n : Re r x ,..., Re r n , Im r x ,..., Im r n e9t, n = 1, 2,...} 

which is countable, as can be established by using the technique for 
solving Exercise 4.7. The set R is also everywhere dense in S ; namely, 
if /e S and e > 0 is given, then as \e x , e 2 ,...] — S, there is a vector 
g = a x e x + *“ + a n e n such that 

11/ ~ ~ **• — a n e n || < e/2. 

Furthermore, we can choose complex numbers r x ,..., r n with rational 
real and imaginary parts so that 

I r k - a k | < e/2 Vn, k = 1,..., n. 

Thus, we have that for h = r x e x + * “ + r n e n e R 

\\f-h\\^\\f-g\\+\\g-h\\< € - + € - = e. Q.E.D. 

There are a few very important criteria by means of which we can 
establish whether an orthonormal system S' is a basis in a Euclidean space. 

Theorem 4.6. Each of the following statements is a sufficient and 
necessary condition that the countable orthonormal system T = {e l9 e 2 ,...} 
is a basis in the separable Hilbert space* 

* The theorem applies to the finite-dimensional as well as the infinite-dimensional case, 
though it is stated and proved here for infinite-dimensional Jf. In the finite-dimensional 
case, oo should be replaced by the dimension of 
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(a) The only vector / satisfying the relations 

(4.13) <e fc |/>=0, * = 1,2,... 

is the zero vector, i.e., (4.13) implies/ = 0 . 

(b) For any vector /e 

(4-14) lira II/- f< efc |/> e JU0, 

”"+°° I' *_i II 

or symbolically written 


/ = X <«* l/> e k , 

fc =1 

where < e k |/> is sometimes called the Fourier coefficient of/. 

(c) Any two vectors satisfy ParsevaVs relation : 

(4.15) </k> = £</1 

fc=l 

(d) For any/e < 2 ^ 

(4.16) II/II 2 = E K«»l/>l* 

fc —1 

We start by proving that the criteria (a) and (b) are equivalent to 
the requirement that T — {e 1 , e 2 is an orthonormal basis, as that 
requirement was formulated in Definition 4.4. To do that we shall 
prove that (a) implies (b), (b) implies “T is a basis” (as formulated in 
Definition 4.4) and “T is a basis” implies (a). 

In order to show that (a) implies (b) we need the following lemma. 

Lemma 4.1. For any given vector / of a Euclidean space S (not 
necessarily separable) and any countable orthonormal system {e x , e 2 ,...} 
in S, the sequence f x ,/ 2 ,... of vectors 

(4-17) /. = t <e k | /> 

fc -1 

is a Cauchy sequence, and the Fourier coefficients < e k |/> satisfy Bessel’s 
inequality 

n/„ii 2 = i i<«* i/>i a < ii/ii a . 

fc»i 


(4.18) 
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Proof. Write 

K = /-/», 

where f n is given by (4.17). We have 

<fn \h n y=o 

because | h n ) — 0 for i = 1, 2 ,..., n , 

< e i I K> =(e\\ / - X < e i l/> 

= <«< I /> - E i/x«i l «y> = o, 

i-i 

as <Cf | Cj> = 8 iy . Thus 

</ l/> = <fn + K | /. + /*„> = </„ |/ M > + <A„ | A m >, 

and consequently, since < h n \ h v ') ^ 0, 

(4-19) </, I/„> A </ |/>. 

By using (4.17) and (c i | e.j') = h i} we derive 

■ IIAII 2 = <y n I A> = E <«i !/>*<«< I l/> 

i,3=l 

= E l<«*l/>l a , 

i-1 

which shows in conjunction with (4.19) that Bessel’s inequality (4.18) 
is true. From (4.18) we can deduce that 

(4.20) £ l<A l/>l 2 < ll/ll 2 < +oo, 

i=l 

i.e., the above series with nonnegative terms is bounded and therefore 
it converges. Since we know that for m > n 

m 

Wfm - All 8 = E K«<l/>l 2 . 

«-«•( 1 


(4.21) 
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we easily see from (4.20) and (4.21) that/! ,/ 2 ,... is a Cauchy sequence. 

We return now to proving Theorem 4.6. If T = {e 1 , e 2 ,...} is a 
countable orthonormal system in the Hilbert space 34?, then according 
to Lemma 4.1 for any given fe 34? the sequence ,/ 2 ,..., where 

fn = Z < e * I />«* 

fc= 1 

is a Cauchy sequence. Since Jf 7 is complete, this sequence has a limit 
g g 34?. 

We can now show that if the statement (a) about T in Theorem 4.6 
is true, then (b) is also true, due to the fact that (a) implies / = g; 
namely, for any k = 1 , 2 ,... we have 

</ ~ g I = lim </ —/ w I «*> 

O 


= </ I «jb> “ E <«< !/>*<«< I «*> = 0- 

n-»+°o r-i 
t=l 

Thus, if (a) is true, we must have / — g = 0. 

It is obvious that statement (b) implies that T = {e 1 , e 2 ,...} is a basis, 
because according to Theorem 4.4 any fe 34? is the limit of elements 
/j ,/ 2 ,... from the linear space (T) spanned by T, where f n e (T) is of 
the form (4.17). 

We show now that the fact that T = {e 1 , e 2 ,...} is an orthonormal 
basis implies that (a) is true. Assume that some/e 34? is orthogonal on 
the system {e 1 , e 2 ,...}. Since /e , e 2 ,...] = 34?, there is a sequence 
gi > £2 >••• G (* 1 , e 2 >•••)> i.e., for some integer s n 


*n 

gn ~ ^ &k e k 5 
fc=l 

which converges to/. Consequently, as </ 1 e k y = 0, 


</!/> = lim </ I £n> = Hm X | = 0, 

7Z->co W->00 


and therefore / = 0 . 

We shall demonstrate that statement (c) is equivalent to (a) or (b) 
by showing that (b) implies (c), and (c) implies (a), and thus finish the 
proof of Theorem 4.6. 

If (b) is true, then we have (see Exercise 4.10) 

</ I g> =* lim </ n I gn>f 


(4.22) 
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where 

n n 

fn Z Xk | fy &k j gn Z Xk I * 

fc=l fc=1 


From the relation 


</n I £n> = Z <*< l/>*<^ I e iX e i \g> ■= Z </I *tX«f I £> 

Li=l i=l 

we immediately obtain ParsevaPs relation (4.15). 

If we assume (c) to be true, then (a) is also true, because if some vector/ 
is orthogonal on {e 1 , e 2 ,...}, i.e., </1 e k ) — 0 , k = 1 , 2 ,..., then by 
inserting/ — g in (4.15) we get 

</l/> = I </l «*><«» l/> =o, 


which implies that / — 0 . 

Finally, (d) follows from (c) by taking again in (4.15) that/ — g. 
Vice versa, if (d) is true then (a) has to be true, because if </1 e k y — 0 
for k — 1, 2,..., then we get from (4.16) that jj /|| 2 = 0, which implies 
that/— 0. Q.E.D. 

It is easy to see that, due to the fact that every Euclidean space can be 
embedded in a Hilbert space (Theorem 4.1), the criteria (b) (c), and (d) 
are also necessary and sufficient criteria for T to be an orthonormal basis 
in a Euclidean space in general, while (a) is necessary but not sufficient 
(see Exercise 4.13). 

4.5. Isomorphism of Separable Hilbert Spaces 

We can now demonstrate for infinite-dimensional Hilbert spaces a 
theorem analogous to the Theorem 2.5 for finite-dimensional Hilbert 
spaces. 

Theorem 4*7* All complex infinite-dimensional separable Hilbert 
spaces are isomorphic to / 2 (oo), and consequently mutually isomorphic. 

Proof. If is separable, there is, according to Theorem 4.5, an 
orthonormal countable basis {e 1 , e 2 ,...} in which is infinite when J#* 
is infinite dimensional. According to Theorem 4.6 we can write for any 
fe 

00 

/ — Z » C l6 “ X/C l/X 

/c -1 



42 


!♦ Basic Ideas of Hilbert Space Theory 


where by (4.16) 

£ \c u I 2 = ll/ll 2 < +oo. 
1 

Therefore 


oc f 


C1 \ 

C 2 ) e/ 2 (oo). 


Vice versa, if 


P 


h e/ 2 (oo), 


then/! ,/ 2 

n 

fn — Ya ^k e k 
fc -1 

is a Cauchy sequence, because for any m > n 

m 

li/ ro -/Ji 2 = E i^i 2 

fc=w+l 

and Z/Li I I 2 converges. Thus, due to the completeness of 3P 
f 1 ,/ 2 ,... converges to a vector f e and we have 

c,c = <e k | /> = lim (e k | £ b,e\ = b k . 

Therefore, the inverse mapping of the mapping / h-* oc f of into 
/ 2 (oo) exists, and has l\ oo) as its domain of definition. Hence the 
mapping f\-> oi f is a one-to-one mapping of onto / 2 (oo). It can be 
easily checked (see Exercise 4.11) that this mapping supplies an isomor¬ 
phism between ^ and Z 2 ( oo). Q.E.D. 

As we shall see later, the above theorem provides the basis of the 
equivalence of Heisenberg’s matrix formulation and Schroedinger’s wave 
formulation of quantum mechanics. 

Theorem 4*8* If the mapping 


is a unitary transformation of the separable Euclidean space into the 
Euclidean space «f', and if {e x , e 2 is an orthonormal basis in 



4* Hilbert Space 


43 


then {ef , e 2 f ,...} is an orthonormal basis in S ', where e n ' denotes the 
image of e n . 

Proof . Let S be infinite dimensional, and denote by <• | and 
<• | *>2 the inner products in S and S'' respectively. Then 

\ y 2 ~ | Bj) i , 

i.e., { e /, e 2 \...} is an orthonormal system in S'. Since each/' e S' has 
a unique inverse image f e S , we have 



«»' = Km 1 / - £ <«» l/>i = 0, 

2 M_>0 ° 11 4=1 1 


which by Theorem 4.6(b) proves that {ef , £ 2 V**} i s a basis. 

The Case when S is finite dimensional can be treated in a similar 
manner. Q.E.D. 


Exercises 

4*L Show that in a normed space JT the real function d(f,g) = 
||/ — g || on X Jf is a metric, i.e., it satisfies all the requirements of 
Definition 3.1. 

4*2* Prove that for any e > 0 there is an N(e) such that 

y +°° \l/2 

for m, n > N(e), where f n is given by (4.2). 

4*3* Check that the operations (4.3) satisfy the axioms in Defini¬ 
tion 1.1. 


4*4* Check that (4.4) satisfies the requirements of Definition 2.1. 

4*5* Show that if JV' is a normed space and JS is the completion of Jf 
in the norm, then: 

(a) is a linear space with respect to the operations 

/ + § = ifi + 5 i j/2 + £2 >•••}> 

of = Wi, a h »•••}; 

(b) the limit ||/|| s = lim^^oo || || exists for every Cauchy sequence 

{/1 ,/ 2 ,...} and defines a norm in 

(c) is a Banach space and the image JT' of Jf in JT defined by 
the mapping/<-> {/,/,...} is a linear subspace of which is everywhere 
dense in ./T. 
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4.6. Show that (4.6), (4.7), and (4.8) satisfy the axioms for vector 
addition, multiplication by a scalar, and inner product respectively. 

4.7. Show that the subset D of Z 2 ( oo) is countable, where D consists 
of all vectors a which have the properties: ( 1 ) a finite number of com¬ 
ponents ,..., a n (for some integer n = 1 , 2 ,...) of a are complex 
numbers with real and imaginary parts which are rational numbers; 
( 2 ) the rest of the components vanish. 

4.8. Show that every finite-dimensional Euclidean space is a separable 
Hilbert space. 

4.9. Prove Theorem 4.4. 

4.10. Show that if in a Euclidean space ,/ 2 ,... converges in norm 
to / and g !, g 2 to g , then </ | £> = lim^^ </„ | g n >. 

4.11. Show that the mapping/<-> of Jf 7 onto / 2 (oo) satisfies the 
requirements for an isomorphism, given in Definition 2.4. 

4.12. Prove that if one orthonormal system {e 1 , e 2 ,...} in a Euclidean 
space € satisfies either (4.14), or (4.15), or (4.16), for every vector/ (or, 
in case of (4.15), for any two vectors / and g) from <f, then {e x , e 2 ,...} 
is a basis in $. 

4.13. Verify that the criterion of Theorem 4.6(a) is not sufficient to 
insure that an orthonormal system {e 1 , e 2 ,...} in a Euclidean space $ 
satisfying that criterion is a basis by showing the following: 

Let { h x , h 2 ,...} be an orthonormal basis in a Hilbert space Jf 7 , and let & 
be the vector subspace spanned by (SfcLi (l/&)A fc ), A 2 , h z ,..., i.e., 
$ — (Xfc=i (1/&) h k , h 2 ,...); then & is a Euclidean space. Prove that: 

(a) — h 2 , e 2 — h% ,..., e n = h n+1 ,...} is not an orthonormal basis 
in S. 

(b) If/e $ is orthogonal to {e 1 , e 2 ,...} C <f, then/ = 0. 


5. Wave Mechanics of a Single Particle Moving in 
One Dimension 

5.1. The Formalism and Its (Partial) Physical Interpretation 

As an illustration of a physical application of the preceding results, we 
shall consider the case of a particle restricted to move in only one space 
dimension within a potential well. We denote the space-coordinate 
variable by x and the time variable by t. Assume that on our system there 
acts a force field F(x) which can be derived from a potential V(x) t i.e., 
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F(x) = —(djdx) V(x). In classical mechanics, if we denote the momentum 
of the particle by />, we have the following expression for the total energy 
£ of a particle of mass m: 

(5.1) E =p 2 12m + V(x). 

Classically the state of the particle is described by its trajectory x(t ), 
where at any moment t , x(t) e U 1 . 

As we mentioned in the Introduction, one of the postulates of quantum 
mechanics is that the state of a system is described by a function *F( t ), 
where W(t) is a vector in a Hilbert space. In the wave mechanics version 
of quantum mechanics, the state of a one-particle system is postulated 
to be described at time t by a “wave function” i/j(x> t) which is required 
to satisfy the condition 

r +oo 

(5.2) , \ifj(x,t)\ 2 dx = 1. 

J —co 

As a function of t , ifj(x, t) is assumed to be once continuously differen¬ 
tiable in t\ in addition we require for the present that i p(x, t) have a 
piecewise continuous second derivative in x. Thus, we can consider 
i fj(x, t) to be at any fixed time t an element of the Euclidean space 
(see Exercise 5.2) of all complex functions f(x) which vanish at infinity 
and are square integrable, i.e., 

/»+°° 

\f(x)\ 2 dx < +oo, 


as well as once continuously differentiable with lim^^oo/'^) — 0. 
In inner product is taken to be 

/»+°° 

<f\g>= f*(x)g(x)dx, 

J —CO 


and consequently we recognize (5.2) to be the normalization condition 

/*+°0 

ITOI 2 = | J>(x, t)\ 2 dx = 1, 

J —00 


where *F(t) e denotes the vector represented by the function 

/i(*) == 0(*» *)• 

As a dynamical law we have in classical mechanics an equation of 
motion derivable from Newton’s second law, which in the present case is 


dV 


d 2 x(t) 


(5.3) 
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In wave mechanics it is postulated that the wave function i/j(x , t) satisfies 
the Schroedinger equation* [we assume throughout that V(x) is piece- 
wise continuous], 


(5.4) 


dt 


h 2 

2m dx 2 


+ V(x)Mx, t). 


A heuristic recipe which leads to the above differential equation is the 
following: replace/) in (5.1) by the differential operator — ifi{d\dx ) and E 
by the differential operator iH(djdt ). If we deal with these operators 
formally and write (d/dx) 2 = d 2 \dx 2 , we get the operator relation 


ih 


d_ 

dt 


h 2 d 2 
2m dx 2 


+ V(x) 


which yields (5.4) when applied to ifj(x , t). 

We note the important fact that Schroedinger’s equation (5.4) preserves 
the normalization condition (5.2): 


d_ 

dt 


,+oo ,+00 , 

I >K.x, <)| 2 dx = ( 

j —00 J —00 ' 

__ h / 

2 im J _oo \ 


dif;*(x, t) 


dt 

+°° /d 2 i/>*(x, t) 
'dx 2 


tfj(x 9 1) + <A*(*, t) — ) dx 


#*> t) - t) dx 


h 

2im 


im Woo L ox dxl x== _ a 

f +0 ° / 36* 36 36* 36 \ \ 

J _ w \~dx~ ~dx ~ ~dx~ ~dx) dx ) = °' 


This is important in view of the correspondence rule proposed by 
Born [1926a, b], which constitutes the generally accepted physical inter¬ 
pretation of quantum mechanics. A very special case of this interpreta¬ 
tion, which will be stated in its most general form in Chapter IV, §1, is in 
the present case given by the rule that for any interval I in U 1 


p t(I) = f I 0(*, t )\ 2 dx 

J i 

represents the probability of detecting the system (i.e., the one particle) 
within the interval /, if a measurement of position is performed at time t. 
Thus 

, + °0 

P < (R 1 )= \4>{x, t)\*dx 

J —oo 

represents the probability of finding the system anywhere and, by 


# h h/lir, h is the Planck constant; numerically h 1.054 X 10 117 erg sec. 
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definition, this has to be equal to one.* In view of this interpretation 
the necessity of (5.2) being fulfilled at all times is obvious. 

As we shall see in Chapter IV, another consequence of Born’s inter¬ 
pretation is that whenever ¥*( t) represents a physical state and c is a 
constant for which | c | = 1 , the vector-valued function (c^t)) represents 
the same physical state. 

5.2. The Wave Mechanical Initial-Value Problem 

In the case of classical mechanics the basic problem which is encoun¬ 
tered is the initial-condition problem: given the position x 0 and velocity 
v 0 of the system at some initial time t 0 , find the state of the system at all 
times; i.e., find the unique trajectory #(£) which satisfies the initial 
conditions 

x(t 0 ) = # 0 , x(t 0 ) = v 0 (x = dx/dt). 

In practice this problem reduces to solving the initial-condition problem 
of the differential equation ( 5 . 3 ), which is of the second order in the 
variable t. 

The analogous initial-value problem in the quantum mechanical case 
consists in giving the state i/j 0 (x) at some time t 0 , and requiring to find a 
state i/f(#, t) at all times £, which is such that 

(5.5) *A(#j *o) = 0o(*)- 

Since the dynamics is now given by the Schroedinger equation (5.4), 
which is of the first order in £, the above initial-condition problem has a 
unique solution if i/j 0 (x) g ^ 2 >(^ 1 )- 

The general procedure of solving the above problem consists in 
reducing it to an ‘‘eigenvalue” problem. One tries first to find a solution 
i/j(x, t) of the form 

(5.6) if*(x, t) = ip(x) exp[— (i/ft) Et], 

where E is a yet undetermined constant. When (5.6) is inserted in (5.4), 
we get a so-called eigenvalue equation for i p(x) and E 

(5 - 7) ~ + V{x)Kx) = mx) ’ 

# A somewhat naive interpretation of the statement that an event E (in this case the 
event is the detection of the system within the interval I at time t) has the probability P y 
ih the following “frequency interpretation”: if the same experiment (in our case, the 
measurement of the position of the particle) is repeated under identical conditions a large 
number N of times, then the number of times the event E occurs should be v & PN> 
i.e., P w v/N, where the approximation « should be increasingly better with increasing 
N (on different concepts of probability see, e.g., Pap [1962], Fine [1973]). 
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which in physics is called the time-independent Schroedinger equation. 
This equation has in general a family <^ b of functions i )j(x) e ^g)^ 1 ) as 
solutions for a set S p of values of the parameter E. The numbers belonging 
to the set S p are called, from the general mathematical point of view, the 
eigenvalues of the differential equation (5.7). 


Theorem 5*1 ♦ If the functions i/q(#), $ 2 ( x ) an d their first derivatives 
<h/j 1 (x)/dx, difj 2 (x)ldx y as well as V(x ) i/q(#) and V(x) i/j 2 (x) y are from 
then 


(5.8) 



h 2 d*Ux) 
2m dx 2 

h 2 d^fx) 
2m dx 2 


V(X)M*)) I«*))• 


If each solution i p(x) of (5.7) has the property that i/j(x ), di/j(x)ldx> 
V(x ) i p(x) e < ^| 2 )(^ 1 )> then each eigenvalue E of the time-independent 
Schroedinger equation (5.7) is a real number, and if i/q(#) and i/j 2 (x) are 
two eigenfunctions of (5.7) corresponding to two different eigenvalues 
E ± 9^ E 2 , then i/f x ( x ) and i/j 2 (x) are orthogonal. 

Proof. Integrating by parts twice we get 


(5.9) 


fV(- 

J -a N 


2m 


h 2 d 2 ijj 2 
2m dx 2 

*# 


+ V^) 


dx 




d'I’i* i i+a ' 


MT(- 


h 2 d Vx* 
2m dx 2 


+ v<p 1 *)h 


dx. 


When we let above a—> + oo, we obtain (5.8) because both integrals 
converge due to the fact that i/q(#), *A 2 (#), difj 1 (x)jdx y di/j 2 (x)ldx , F(#) i/q(#), 
V(x) i/j 2 (x) e ^J 2 )(^ 1 )> while the first two terms on the right-hand side 
of (5.9) vanish due to the fact that, by definition, any function f(x) in 
^ 2 )(^ 1 ) has to vanish at infinity together with its first derivative. 

If i/q(#) and i/j 2 (x) satisfy the conditions of the theorem so that (5.8) is 
true, and if in addition they are solutions of (5.7) corresponding to 
eigenvalues E 1 and E 2 , respectively, then we can write (5.8) in the form 

(5.10) ^0i(tf) I E 2 ifj 2 (x)y = ^£i0i(x) | 0 2 (#))* 

By taking in (5.10) i/q(tf) = i/» 2 (#) = i/»(#) and consequently E t = E 2 = 
E y we obtain 


E<m i m> ~ i ^ x y>. 
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If ifj(x) is a nontrivial solution of (5.10) [i.e., if ifj(x) ^ 0], we have 
<i/r | i/r> > 0 and therefore we must have E — £*; i.e., we have proved 
that the eigenvalues of (5.7) are real under the specified conditions. 

In order to prove the last point of the theorem, namely that p 1 _[_ i p 2 
when E 1 ^ Z? 2 , it is sufficient to note that due to the fact that E 1 and E 2 
must be real numbers, we can write (5.10) in the form 

(E 2 — | ifj 2 (x )> = 0. 

Since E 1 E 2 , the above implies that <i/q(#) | i/^M) — 0* QE.D. 

We shall establish in Chapter IV that the above theorem is true under 
more general circumstances than the ones mentioned so far. 

5.3. Bound States of the System 

From the physical point of view the wave functions in <? b are considered 
to be bound states* and the numbers in S p are taken to be (as a special case 
of Born’s correspondence rule given in Chapter IV) the only possible 
energy values that a system in a bound state can assume; they are there¬ 
fore said to be the energy eigenvalues of the bound states , and the set S p 
is called the point energy spectrum. 

We note (see Exercise 5.3) that for a given eigenvalue Eg S p the 
corresponding set of eigenfunctions is a linear subspace M £ of ^f 2 )(^ 1 )* 
If the subspace M E is one dimensional then the energy eigenvalue E is 
called nondegenerate ; otherwise E is said to be degenerate. 

It is easy to see that the Euclidean space ^J 2) ((^ 1 ) is not complete (see 
Exercise 4.2). According to Theorem 4.1 there is a completion of ^^(If^ 1 ) 
which we denote by Jf 7(1) . We shall establish in Chapter II that Jf {1) 
is a separable Hilbert space. It is assumed that each vector of Jf 7(1) can 
represent a physical state at a certain time. 

We shall limit ourselves in the rest of this section to the closed subspace 
.^ 1) spanned by + $ h . By definition, each one of the vectors in <o h 
represents a bound state at a given time. As is a subspace of J^ {1) 
which is separable, it has to be also separable according to Theprem 4.2. 

Each one of the closed subspaces M £ of corresponding to the 

eigenvalue E must also be separable. Consequently, according to 

* Classically, a particle moving in a central force field is said to be in a bound state if 
its trajectory is a closed curve. 

1 To simplify the notation we denote by the same symbols the incomplete Euclidean 
spaces and their isomorphic images in their completion; we do the same with the elements 
of tliese spaces and the image points of these elements in the completion. 
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Theorem 4.5 there is in each M E an at most countably infinite orthonor¬ 
mal system T E spanning M £ . The closed subspace spanned by 

T = U 

Eg Sp 

is obviously identical to The set T is an orthonormal system 

because each T £ , E e S p , is an orthonormal system, while for E 1 E 2 
the sets M £i and M £g , and therefore also T Ei and J Ez , are orthogonal. 
Since all the vectors in T are linearly independent (see Exercise 4.12) 
while is separable, it follows (see Exercise 5.5) that T is a countable 
set. 

In practice the elements of each T E can be chosen to belong to ^J 2 )(^ 1 ) 
and therefore satisfy (5.7). Consequently, if we write T in the form 

the vector W k e will be related by the construction outlined in 
proving Theorem 4.1 to a wave function i/j k (x) e satisfying the 

time-independent Schroedinger equation 

- £ + n*) M*) = E k Ux). 

As T is an orthonormal basis in we can expand according to 

Theorem 4.6 every vector in in the form 

** = £ <^|^>^. 
k=1 

It must be remembered that the precise meaning of the above expansion 
is 

lim W — £ iW k \W^W k = 0. 

We claim that the general solution of the initial-value problem for 
bound states, when the intial state at some time t 0 is represented by a 
vector Wq e is given by 

(5.11) ¥{t) = £ exp [- J E k (t - <„)] <¥ k | ¥«> W k . 

The above scries is again required to be a limit in the norm of its partial 
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sums; E k is the eigenvalue corresponding to the eigenfunction 
if* k (x) e which represents the vector W k . 

First we have to establish that the series in (5.11) really converges for 
every t to an element W(t) of 

Theorem 5*2* For any fixed t the sequence t ), 0 2 (£),... 

*n(t) = t C k (t) W k , 
k=l 


c k (t) = exp [- i E k (t - i 0 )] <W k I y 0 >, 

is convergent in the norm to some vector W{t) For t — t 0 its 

limit ^P(t 0 ) satisfies the initial condition *F(£ 0 ) = *P 0 . 

Proof. As W 1 , W 2 ,... is an orthonormal basis in we have 

according to Theorem 4.6 

Z I c*(*)l a = Z I W = II ^0 II 2 < +QO. 

*=1 fc=l 

Thus, the sequence 0 1 (t), is a Cauchy sequence, and consequently 

it has a limit W(t ), since is complete. As c k (t 0 ) = ( x P k \ 

k = 1, 2,..., are the Fourier coefficients of W 0 , we get from Theorem 4.6 

n*o) - • q.e.d. 

In order to justify* the statement that (5.11) represents the solution 
of the initial value problem, assume that the series 

(5.12) £ exp [ - ^ E k (t - *„)] c k $ k (x), c k = <W k \ W 0 >, 

converges in the norm as well as point by point for every value of x and t 
to a limit function i )j(x, t ), and that d 2 ifj(x, t)/dx 2 and difj(x, t)ldt can be 
obtained by differentiating the series (5.12) term by term twice in x and 
once in t, respectively. If we insert this i p(x, t) into the Schroedinger 
equation (5.4) and interchange the order of summation and differentiation, 
then we easily find that i/j(x, t) satisfies (5.4) by taking into consideration 
that each f k (x> t) satisfies (5.7) for E = E k . 


# A complete and general proof will he given in Chapter IV. 
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5.4. A Particle Moving in a Square-Well Potential 


In order to provide a simple example of the procedure outlined above, 
let us consider the case of a particle moving in a so-called square-well 
potential F(#), given by 

K(«) = l° for 

' ' \V 0 > 0 for x <0 and x > L. 

This potential represents the idealized case of a force which is every¬ 
where zero except at the “walls” x = 0 and x — -\-L of the “potential 
well,” where it is infinite. The time-independent Schroedinger equation 
(5.7) becomes in this case 

*y&+*[Z-Et(x)= 0 , 

( ] + =°, *<0, X>L. 

The general solution of the system (5.13) is 


ce ikx + de~ ikx , 
ijj(x) — | a 1 e~ ik ' x + b 1 e ik ' x , 
a 2 e ik " x -|- b 2 e ~ ik " x , 


*=(“f • 

* - 

k „ = j- 2 m(E - V 0 ) j 


1/2 


1/2 


X < 0 


x > L. 


The above expressions should represent the same wave function 
in the three different regions. Due to the normalization condition (5.2) 
a necessary condition for this is that ifj(x) — > 0 as | x | -> -foo. This is 
possible only if k' = k" is a pure imaginary number: 

k' = k” = I*, ic - (l/*)[2m(K 0 - E)] 1 '* > 0 
and b 1 ~b 2 — 0. 

Since i/j(x) e ^g)^ 1 )* i l J ( x ) also has to be continuous and have a 
continuous first derivative at x — 0 and x — L; i.e., we have to impose 
the following conditions: 


(5.14) 


’AC*') I cc—>—0 

dip I 


dx 


0 


<l>(x) lawt-o 
dijj 

dx 


a i — ^{x) las-M-0 
difj 
dx 


C + dy 

= ik(c — d), 


I £C—»-f-o 

= ce ikL + de~ ikL = ifj{x) \ x _ Lh) 

dijj 
dx 


doe 


= ik(ce ikL - de~ ikL ) 


«!->/*) 0 


—kL 

■~a 2 t<e kL , 
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By eliminating from the first pair and a 2 from the second pair of 
equations (5.14) we are led to the following linear system of equations 
for c and d : 

(k — ik)c + (k + ik)d = 0, 

(5.15) 

(k + ik) e 2lkL c + (k — ik)d = 0. 

In order to have a nontrivial solution (| c \ -f | d \ > 0) of the system 

(5.15) , the determinant 


(5.16) 

must vanish. Since 


k — ik k + ik 

(k + ik) e 2iJcL k — ik 


(5.17) k = (1/h) V2mE , * = (1 /k) V2m(V 0 E), 

and ft, niy and V 0 are fixed constants of the problem, the condition that 
(5.16) should be zero is a transcendental equation 


< 5 - 18 > - (frir) 

to be satisfied by E. As we have 

I LT - ll? I 

1 , 

I k -f- ik I 

we can write 


k —|- ik I 

k — ik 
k -f- ik 


where 


(5.19) 


K*~k* E 

cos <f> = ——rs- =1—2 




sin (f> = — 


2t<k 


Vo ’ 
E 


K 2 + k 2 


=-[i( 


E \i 1/2 


r;)l 


The relation (5.18) is equivalent to 

$ = kL = (1 IH)V2mE, 

where (5.19) is determined by (5.18) up to a multiple of 2t t, i.e., 
<l> = arc cos — 2 H 27t//, n = 0, ±1, ±2,... . 
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Thus, we have an entire family of eigenvalues E n , constituting the point 
spectrum of energy levels of a single particle moving inside the square- 
well potential. The energy values E n can be obtained by solving numeri¬ 
cally the equation 

^ V2mE = arccos ^1 — 2 

for each n. Each eigenvalue E n is nondegenerate, and the corresponding 
eigenfunction i/s n (x) is 


(5.20) 



2 ik n 


+ ik 


x < 0 




K n + ik n 
^ik n x _ ^ 

( p (x n +ik n )L _ K n l ^n ( K n -ik n )l\ -K n x 

\ x n + ik n r ’ 


x ^ L, 


where the expressions for k n and K n are obtained by taking in (5.17) 
E = E n . The absolute value | c n | of the constant c n has to be determined 
from the normalization condition 

/»+°° 

I I W*)l 2 <** = 1. 

The remarkable feature of the eigenfunction (5.20) is that $ n (x) does 
not vanish outside the region 0 ^ x < L. This means, in view of the 
physical interpretation of | i/j n (x)\ 2 , that there is a nonvanishing proba¬ 
bility of finding the particle outside the potential well—a typical quantum 
mechanical phenomenon which is not met in the classical treatment of 
the same problem, where the particle in a bound state is trapped inside 
the potential well. 

Exercises 

5*1 ♦ Check that the set ^ n ([—a, +#]) of n-times continuously differ¬ 
entiable functions f(x) defined on the finite interval [— a , -\-d\ becomes 
a Euclidean space if the vector operations are defined by 

(af + bg)(x) = af(x) + bg(x) 

and the inner product by 

</ l£> = f' /*(*) g{x)dx. 

J -a 
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5*2* Show that the set ^( 2 )(^ 1 ) of once continuously differentiable 
functions f(x) that vanish at infinity together with their first derivatives 
and are square-integrable is a complex Euclidean space if the vector 
operations are defined for f>ge by 

(/ +g)(*) =/(*) +£(*)> 

(af)(x) = af(x) 9 a e C 1 , 
and the inner product by 

r +0O 

</ I g> = f f*(x) g(x) dx. 

J —00 


53 * Show that if i/j^x), $ 2 {x) e ^2)(^ 1 ) are eigenfunctions of (5.7) 
corresponding to the eigenvalue E, then for any given complex numbers 
a and b the function ai/j^x) + b\fi 2 {x) is also an eigenfunction with the 
eigenvalue E. 

5*4* A linear harmonic oscillator in one dimension is a particle of 
mass m whose potential energy is 

V(x) = \moj 2 x 2 9 

where o> denotes in classical mechanics the characteristic frequency of 
oscillations which the particle performs. Show that in the quantum 
mechanical case the energy spectrum S p of bound states is nondegenerate 
and equal to 

S p = {\hco 9 f£co,..., (n + J) ha> 9 ...}. 

Verify that the energy eigenvalue E n — (n + i) corresponds to the 
normalized eigenfunction 

$ n (x) = (mo>/7r^) 1/4 (2 w «!) -1/2 exp[— (nuoj2h)x 2> '\ H n |# ]> 

where H n (u ), n = 0, 1, 2,..., are the Hermite polynomials 
//» = (- 1 )»^^ (,-«*) 

= ( 2„)n _ 2 )(” - 3 ) (2u) n ~ i - .... 

5*5* If T is an orthonormal system in a separable Hilbert space, 
prove that T is a countable set. 
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References for Further Study 

On finite-dimensional vector spaces: BirkhofF and MacLane [1953], and Nering [1963]. 
For §§2-4 consult Dennery and Krzywicki [1967], von Neumann [1955, Chapter II], 
and for a more advanced approach, Naimark [1959, Chapter I], For §5 see Landau and 
Lifshitz [1958], Messiah [1962], or any other of the many standard textbooks on quantum 
mechanics. 



CHAPTER || 


Measure Theory and Hilbert Spaces of Functions 


In the last chapter we met examples of Euclidean spaces whose 
elements were square-integrable functions. These spaces were not 
complete, and in order to find their completion we had to employ the 
constructions outlined in §3 and Theorem 4.1 of Chapter I. We can 
embed, however, Euclidean spaces of functions of a very general type 
in Hilbert spaces whose elements are also functions. In order to study 
these spaces it is necessary to familiarize ourselves with the basics of 
the modern theory of integration in general, and the theory of the 
Lebesgue integral in particular. 

In § §1, 2, and 3 we define and study the basic constituents of measure 
spaces in order to be able to define in §3 integration on such spaces. 
The knowledge of the methods used in proving the theorems formulated 
in these three sections is not absolutely essential for an understanding 
of the rest of the material in this book. Consequently, at the first reading, 
the reader may skip the proofs and limit himself to reading the various 
definitions and theorems in order to familiarize himself with the basic 
concepts and results. 

In §4 we introduce Hilbert spaces of square-integrable functions and 
in §5 we study the application of the theory of these spaces to the wave 
mechanics of n particles of different kinds. The introduction of the 
concepts of direct sums and tensor products of Hilbert spaces in §6 
proves desirable before a more detailed study of the two-body bound- 
state problem is carried out in §7. 
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Measurable Spaces 

1.1. Boolean Algebras and <t Algebras of Sets 

While discussing the physical interpretation of wave mechanics, we 
have seen that for each interval I 

(1.1) * P,(I) = fMx,t)\*dx 

is the probability of finding the system [which is in the state £)] 
within / at time t. The formula (1.1) provides an example of a real 
function whose domain of definition is a family of sets. Such a function 
is called a set function. In §2 we shall be concerned with a special 
class of set functions, which are called measures , and which are basic in 
the modern theory of integration. In order to define the concept of 
measure we need first the concept of measurable space. 

In case of measurable spaces we deal again with a set SC in which a 
certain type of structure is specified. The structure is now determined 
by giving a family of subsets of SC which satisfies certain axioms, and 
which is then called a Boolean a algebra. 

Definition l A* A nonempty class JT of subsets of a set SC is called 
a Boolean algebra (or field or additive class) if we have 

(1.2) R\J S e JT, whenever R, S e JT, 

(1.3) R' e JT, whenever R e JT, 

where R' = SC — R denotes the complement of R with respect to SC 
(i.e., the set of all elements of SC which do not belong to R ).* The class 
CtiC of subsets of SC is called a Boolean o algebra (or a field , or completely 
additive class) if in addition to being a Boolean algebra it has the property 
that the union U/£=i S k of any countable number of sets S 1 , S 2 ,... from 
JT also belongs to 

Theorem 1*1* If the class JT of subsets of a set SC is a Boolean 
algebra, then 

(a) the entire set SC and the empty set 0 belong to JT, 

(b) the intersection R n S belongs to Jf whenever R, S e JT, 

(c) the difference R — S and symmetric difference, R A S = 
(R — S) U (S — R) belongs to JT whenever R, S e C/C. 

# We prefer the name Boolean algebra to that of field because thus no confusion with the 
concept of field of scalars can arise. 
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The fact that SC e Jf' if Jf' is a Boolean algebra follows from the 
requirement that there is a subset R of SC which belongs to . Conse¬ 
quently, R' e JT and therefore SC = Ru R' e JT, as well as 0 — SC't CtiS 
In order to prove the other two statements of Theorem 1.1 we need 
to establish some of the general rules of dealing with sets. 

Lemma LL If ^ is a family of sets, and R is any given set, then 

fin fu5)=u^4 

\Se^ / Se^r \ 1 

Proof. If £ e R n (Use.# S) then by definition £ e R and £ e Use# S , 
i.e., | belongs to at least one set S x from F. Thus, £ e R n S x , S ± e 
and consequently £ e Use#(^ n S). 

Conversely, if rj e Use#CK n *5), then rj belongs to at least one of the 
sets (R n S), i.e., rj e i? and rj e Use# *5- Thus, rj e R C\ (Use# *5). 

Q.E.D. 

It is interesting to note that if we look upon the operation U as set 
addition and n as set multiplication, then we obviously have that these 
operations are commutative 

Rv S = S v R, Rr\ S = S n R, 

and associative 

R u (S u T) = (R u 5) u T, i? n (5 n T) - (i? n 5) n T, 

like the ordinary addition and multiplication of numbers. Furthermore, 
as follows from the above lemma, they are also distributive: 

Rn(SuT) = (Rn S) u (Rn T). 

Lemma 1.2* If F is any family of subsets of a set SC, and if for 
any given set S we denote by S' the complement of S with respect to SC, 
then 

(i.4) (us)'=ns', (ris)'=[js'. 

\Se# / Se# \Se# / Se# 


Proof. To establish the first relation, assume that £ is an element of 
(Use# S)' . Then £$ Use# S, which means that £ does not belong to 
any of the sets S g F, i.e., £ g S' for all S e F; thus £ g f|se# S'. 

Conversely, if rj e f)se# S', then r] g S' for each S g i.e., 77 ^ S for 
all 5 G F. Therefore, rj $ Use# S, which implies that 77 is an element of 
(Us©.# S)', thus proving completely the first of the relations (1.4). 
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We obtain the second of the relations (1.4) by taking in the first of 
the relations (1.4) the complement of the left- and right-hand side, 
and noting that for any subset R of ( R')' — R" = R. Q.E.D. 

It is now easy to finish the proof of Theorem 1.1. In order to prove 
Theorem 1.1(b), write 

R n S = (R' u S')'. 

Since we have R, S e Jf, we also have by (1.3) R', S' e JT, and conse¬ 
quently from (1.2) we get R' U S' e Jf\ 

Similarly we get 

R - S = R n *S" = (R' u S")' = (R' u S)' e JT, 

by noting that R — S — R n S'. The result that R — S, S — R e JT 
whenever R, S e JT yields 

R A S = (R - S) u (5 - i?) e JT. 

Theorem 1*2* For any given nonempty family ^ of subsets of a 
set there is a unique smallest* Boolean algebra s#{F) and a unique 
smallest Boolean or algebra stfJfF) containing and stfJfF) are 

called, respectively, the Boolean algebra and the Boolean or algebra 
generated by the family F. 

Proof. We give the proof for the proof for the case of stf a (F) 

is completely analogous. 

Denote by g the family of all Boolean algebras sf containing F, 
i.e., if srf e g then F C j/; g is not empty because it certainly contains 
the family 6^ of all subsets of 3C y which is obviously a Boolean algebra. 
The family 

st[F) = 0 6 

is a Boolean algebra; namely, s/(F) is not empty since it contains F y 
and if R, S e s/(F), then for each SgJ we have i?uS, and 

consequently R U S', R'gs/(F). In addition s/(F) is the smallest 
Boolean algebra containing since according to its construction, it lies 
within any other Boolean algebra containing F. 

There is only one smallest Boolean algebra .90(F) containing 


* The set S belonging to a family & of sets, is said to be the smallest set in that family & 
if it is contained in any other set belonging to &. 
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SF ; namely, if and s/ 2 (^) were two such algebras, then 

n would also be a Boolean algebra containing &. That 

is possible only if = s/ 2 (^). Q.E.D. 

1.2. Boolean Algebras of Intervals 

For our purposes the most important Boolean algebra is the one 
generated by the family J> n of all intervals—including the degenerate 
intervals, which contain only one point, and the empty set 0 — in the 
n-dimensional vector space ( U n ). 

Theorem 13* The family of all unions 

/jU & = 1, 2,... 

of intervals in is identical to the Boolean algebra generated 

by J* n . 

Proof. We obviously have SS^Cs^iW 1 ). Therefore, in order to 
show that = ^(W 1 ) it is sufficient to prove that SBq is a Boolean 
algebra; namely, if we had that result established, then SSq would be a 
Boolean algebra containing J > n , and since s/(U n ) is the smallest Boolean 
algebra containing J rn , we would also have I) ^(IR 71 ), i.e., 

SSI = ^(IR 71 ). 

If R, S g SB o, we can write 

R=fu - u J*, S = /iU-u/„ 

where,..., / z e ^ n . Hence 

i? u S = u ••• u/ fc u /j. u ••• u JjgJJ. 

To prove that R' e we proceed by induction in ft. In the case 

ft — 1 we have R = I x is an interval, and it is easy to check that If can 
be written in the form 

If = / (1) u ••• u jw P)./ W e/ w , v < 2w3 w -\ 

i.e., jR' e Assume now that the statement R ' e * s true f° r a ll ^ 

of the form I x U ••• U I k , i.e., that we can write 

(1.5) (4 u - u 4)' = U Jm , Ji /„ e./». 

m=l 

'L'hen in the case k 1, by writing 

(1.6) 4 + i = ; a) u-u/w 


/“>.; w 6/*, 
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we get from (1.5) and (1.6), with the help of the Lemmas 1.1 and 1.2, 

(h u - u h+i)' = (A u ••• u i k y o r k+1 

= 0 (J a) " J m )1 u -.u f 0 (/<-> n / m )|. 

-m=1 J Lm=l J 


Since / (1) n J m ,..., / (v) n J m E^f n y we conclude that (/j U***U / fc+1 )' 

Q.E.D. 

Notice that in constructing &Q>it would have been sufficient to 
consider only disjoint finite unions of intervals (see Exercise 1.5). 

1.3. Borel Sets in U n 

A set belonging to the Boolean a algebra $0 a (J> n ') generated by J n 
bears the name of a Borel set in n dimensions. We cannot generalize 
Theorem 1.3 to Borel sets by saying that every Borel set is a countable 
union of intervals, because there are Borel sets which are not that. An 
example of such a Borel set in ^(R 1 ) is the set 3 of all irrational numbers; 
3 is a Borel set because it is the complement of the set of all rational 
numbers, and 91 is the countable union of all degenerate intervals 
containing only one rational number, and therefore is a Borel set. 

The fact that the family of 3% n of all Borel sets in U n is very large, 
though by no means contains every subset of (R n , is confirmed by the 
following statement. 

Theorem L4* Every open and every closed set in the Euclidean 
space R n is a Borel set. 

Proof . Assume that O is an open set* in U n . For every natural 
number m we consider the following open intervals in U n 


r(m) 
ki . 



k 1 — 1 + 1 

m m 

k l 5 * * • 5 kn lj 2 ,..., 





+ 1 
m 


which obviously cover U n . The union 


R(m) 


(jlm) 

V l k x . k n 


ll’"L.K CO ’ k i .*« = 1,2,...} 


* O is an open set in a metric space if for every point sceO there is an • neighborhood 
of x which lies within O. 
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of all intervals lying within O is evidently countable. Further¬ 

more 

00 

O = (J 

m=1 

namely, if * e 0 is any given point of O, then there is an e neighborhood 
of x contained in 0, and consequently for a sufficiently large m, m > 2n/e, 
there will be an interval containing x and lying within that 

€ neighborhood. 

Since we have written O as a countable union of Borel sets R {m \ it 
follows that O is a Borel set. 

Any closed set C in R n is the complement (with respect to R n ) of 
the open set C' — O. Since O e it follows that C — O' e 3$ n . Q.E.D. 

Given the Boolean algebra generated by some family we 

can construct the Boolean or algebra generated by S' y by means 

of monotone sequences of sets from 

1.4. Monotone Classes of Sets 

Definition 1*2* An infinite sequence S ± , S 2 ,... of sets is called 
monotonic ally increasing (or expanding) if S x C S 2 C •••, and monotonically 
decreasing (or contracting ) if S x D S 2 D • • • ; in the first case we write 

lim S k = (J S k , 

w 

while in the second case we write 

lim S k = Q S k . 

A nonempty class J(T of subsets of a set SC is called a monotone class if 
every monotone sequence S x , S 2 ,... e Jf has its limit in JT, i.e., 
lim fc _ )00 S k g erff '. 

Theorem 1*5* If ^ is a nonempty family of subsets of a set 
there is a unique smallest monotone class 9Jt(JF) containing which is 
called the monotone class generated by SP . 

The proof of the above theorem can be carried out in the same 
fashion as the similar proof for in Theorem 1.2, by noting that 

the family of all subsets of SC is a monotone class. 

^Theorem L6* Every Boolean a algebra is a monotone class, and 
every Boolean algebra which is a monotone class is a Boolean a algebra. 
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Proof, (a) If si a is a Boolean a algebra and R x , R 2 ,... e si a is mono- 
tonically increasing, then by Definition 1.1 of a Boolean or algebra, 

00 

(1.7) lira R k = (J R k e . 

k ■*" *=i 

In the case that , S 2 is a monotonically decreasing sequence in si a , 
then S x — Si , S x — S 2 ,... is a monotonically increasing sequence in 
si a , and we have according to (1.7) (see also Exercise 1.3) 

S x — lim S k = lim^j. — S k ) e si a . 

k->co fc-*OC>' 

Therefore 

lim S k = S x (S x lim S k ) £ si a . 

«->00 & H >00 

(b) If si is a Boolean algebra and a monotone class, and S x , S 2 
is an infinite sequence of sets from j/, then R x , i? 2 

n 

R n ^[jS k , * = 1.2,..., 

fc=l 

is a monotonically increasing sequence in si. Since si is a monotone 
class, we have 

00 

U s k = lim £ ja/, 

and therefore si is a Boolean cr algebra. Q.E.D. 

^Theorem 1*7* If si is a Boolean algebra and 9Jt(j/) is the mono¬ 
tone class generated by si , then 9Jl(j/) is identical with the Boolean 
cr algebra sifsi) generated by the family si of sets. 

Proof. Since, according to Theorem 1.6, sifsi ) is a monotone class, 
while by definition 1 (j/) is the smallest monotone class containing si , 
we must have 

(1.8) W(si)Csi 0 {si). 

Since si a {si) is the smallest Boolean cr algebra containing si , by proving 
that 9Jt(j/) is a Boolean cr algebra, we could also deduce that 

si a {si) c m{si\ 

which would imply in conjunction with (1.8) that si a {si) ~~ s M{si). 
Now, in order to prove that s Bl(,c/) is a Boolean cr algebra, it is sufficient, 
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due to the second part of Theorem 1.6, to establish that is a 

Boolean algebra. 

For a given R e denote by the family of all sets S in9t 

which are such that S', R U S e The family is a monotone 

class. As a matter of fact, if S 1 , S 2 ,... e is a monotone sequence, 
then since W^{^/) is a monotone class (see Exercise 1.6), 

(lim S n )' = lim S n ' e 5 CR(j2/), 


while the sequence of the sets R U S x , R U S 2 ,... e5CR(j/) is also a 
monotone sequence and we have (see Exercise 1.6) 

(lim S n ) U R = lim(£ w Ui?)e 5 CR(js/). 


In addition, if Restf, then 9l(i?) is certainly not empty because it 
contains j/. Since SCR^) is the smallest monotone class containing stf, 
we must have 9l(i?) == 9 JI(j/). Thus, 9l(^") = $R(j/), and therefore 
S' e s IR(j/) for every S e 9l(^") = 9Jt(j/). Furthermore, if R e j/, then 
for an arbitrary S e9Jt(j/) we have S e 9l(i?) = $R(j/), i.e., i?uS = 
S KJ R e5CR(j/); consequently, we also have R e 91(5) for any e 
i.e., 9l(S) I) jaf. Since 9l(S) is a monotone class, while 9Jt(j/) is the 
smallest monotone class containing s/, it follows that 9l(*S) = 5 CR(j/) 
for arbitrary S e 9Jt(^). This means that R U S, R' e 5 CR(j/) whenever 
R, S e 9l(S) = 9Jt(j/), i.e., $R(j/) is indeed a Boolean algebra. Q.E.D. 

Definition 1*3* The ordered pair {SC, <stf) consisting of a set SC and 
a Boolean or algebra s# of subsets of SC is called a measurable space . 
Every set S' belonging to is said to be a measurable set in the measurable 
space {SC, 

By following the tacit convention adopted from the beginning with 
regard to mathematical spaces, we shall denote the measurable space 
{SC, stf), consisting of the set SC and a structure on SC given by the 
Boolean or algebra s/, also by the letter ^—except where ambiguities 
might arise. Thus, due to the properties of Boolean or algebras, we can 
say that if R is a measurable subset of SC, then R' is also measurable, 
and if , S 2 ,... is a countable family of measurable subsets of SC, then 
their union U£Li S k is measurable in SC. 

Exercises 

LI* Show that if R and S are any two sets, then 
RnS = {Ru S) - {R A S). 
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L2* A nonempty class X of subsets of a set SC is called a Boolean 
ring (or additive class of sets) \{ Ryj S, R — S g X whenever R, S e X; 
a Boolean ring is a Boolean or ring if U S 2 U • • • e whenever 
S-l , S 2 ,... G Explain why every Boolean algebra (or algebra) is a 
Boolean ring (or ring). 

L3* Show that if 0t is a Boolean ring (see Exercise 1.2) of subsets 
of the set SC, then R — S, RAS, RnSeg# whenever R, S g Sflt. 

I A. Prove that if SC is a family of subsets of a set SC, there is a 
smallest Boolean ring and a smallest Boolean or ring 

containing !F\ these rings are said to be generated by . 

1*5* Show that the family SSq of all unions of intervals is identical 
with the family of all disjoint unions of intervals. 

1*6* Show that if S x , S 2 ,... is a monotone sequence of subsets of a 
set SC, then for any given set R, the sequences R U , R U S 2 ,..., 
R n S x , R n S 2 ,... and Sf, S 2 \... are monotonic and R U lim^^ S k = 
lim k _> +ao (R U S^), n lim fc _> +00 S k = lim fc _^ +00 (lim fc _> +00 — 

lim fc _> +00 . 

1*7* Show that Theorem 1.5 is true not only for Boolean algebras, 
but also, more generally, for Boolean rings. 

L8* Prove that if si is a Boolean algebra of subsets of SC, and R is 
any given subset of SC, then si R = {R n S: S g si] is a Boolean algebra 
of subsets of R. Formulate and demonstrate the same result for Boolean 
or algebras. Show that if si generates the Boolean or algebra si a of subsets 
of SC, then si R generates the Boolean a algebra si a R = {R n S: S g si a } 
of subsets of R. 

L9* Let si be a Boolean or algebra and S x , S 2 ,... a countable 
family of sets from si. Show that HfcLi belongs to si. 

*2. Measures and Measure Spaces 
2.1. The Concept of Measure 

In the last section we mentioned the concept of a set function which is, 
in general, a mapping of a family X of sets into IR 1 or C 1 . A set function 
F(S), S g X, is said to be additive if for any disjoint S x , S 2 e X 

F(S, U S 2 ) == F(S X ) I. F(S 2 ); 




2. Measures and Measure Spaces 


67 


it is said to be or additive or countably additive if for any countable 
family S x , S 2 ,... of disjoint sets in CCC we have 

F(S X u S a u ■■•) = F(S X ) +F(S 2 ) + - . 

Definition 2*1 ♦ A measure n(S) is an extended real-valued set 
function whose domain of definition is a Boolean algebra si, and which 
satisfies the following requirements: 

(1) M (0) = O, 

(2) ii(S) > 0 for all Ses/, 

(3) MULi S fc ) = ZLi M(Sfc) if the sets , S 2 are disjoint, i.e., 
if n Sy = 0 when i ^ j. 

We see that a measure is an extended real-valued nonnegative or-addi- 
tive set function. The adjective “extended” indicates that for some sets 
S £ si we might have fx(S) = -f oo. The symbol -f°o is thus formally 
adjoined to the set R 1 , and in the extended set R 1 U {-f- oo} we define 
the formal operations a-f-oo — oo, a • oo = oo if a > 0, and 0 • go = 0, 
a/co = 0, for all a e R 1 KJ {-f-oo}. 

Definition 2*2* A measure space {SC, si, fi) is a measurable space 
{SC, si) for which a measure (jl{S) is defined on the Boolean or algebra si. 

For any S e si, the number /x(S) is said to be the measure of the 
set S . In the case that /x(S) < +oo, S is said to have a finite measure. 
If the set S' is a countable union Un=i $n °f sets S n of finite measure, 
then the measure (jl{S) of S is said to be or finite. A measure (jl{S) on si 
for which the set SC has a finite (or-finite) measure is called a finite 
(or finite) measure; the measure space {SC, si, fi) is then also called finite 
(or finite). 

As a simple example of a measure whose domain of definition is 
(see §1), consider the set function /x ( i 1) (jB) which for an interval I (open 
or closed) from a to b on the real line is equal to the length b — a of 
the interval. Since any element B e can be written as a finite union 
of disjoint intervals (see Exercise 1.5) 

(2.1) J,nJ,= 0 for 
we can define ^\ 1] {B) by 

(2.2) $(B) = $&) + • • • + fIM) > 0. 

The number ^{B) is called the Lebesgue measure of B e . It is 
obvious that the value ^\B) is uniquely determined regardless of the 
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way we split B into disjoint intervals. It is easy to see that the set function 
is or additive. Hence, if we define /x ( J 1) (0) — 0, we have a measure 
on B ) is a or-finite measure because we can write 

K 1 = Ur-ooK n+l] and the Lebesgue measure of any bounded 
interval is finite. 

The type of measure with which we shall frequently have to deal is 
the probability measure. A probability measure /x p (S) is a measure on 
a measurable space {SC , si), which assigns to each S e si a number 
which represents the probability that an event, whose possible outcome 
could be any point in SC and only a point of SC, will happen to be in S. 
Due to this interpretation it is required that a probability measure 
satisfy the “normalization” condition p v {SC) = 1; thus, every probability 
measure is necessarily finite. 

An example of a probability measure—of the type we shall encounter 
very frequently in the future—is the measure P t {B), B e S8\, defined 
on 8§\, whose value for B = / e is given by (1.1); for a general 
Be Jj, which can always be written in the form (2.1), we define (see 
Exercise 2.3) 

P t (B) = P t (/0 + - + P t (I k ). 


2.2. Basic Properties of Measures 

Definition 2*3* An extended real-valued set function F{S) is said 
to be continuous from above {below) if for every monotonically decreasing 
(increasing) sequence S x , S 2 ,... of sets from the domain of definition 
of F{S) we have 

F(lim S k ) = \jmF(S k ) 

&H>CO &H>CO 


whenever F(lim fc _* 00 S k ) is defined and, in the case that S x , S 2 ,... is 
decreasing, whenever |F(S n )| < +oo for at least one value of n (see 
Exercise 2.4). 

Theorem 2*1* Every measure is continuous from above and below. 

Proof. If R ± , R 2 is a monotonically increasing sequence from 
the measure space {SC, sf, fi) and lim^oo R k e sf, then we can write 

00 

lim R k = {R k R k _ i), R 0 = 0 , 

^°° *-i 

where the sets R x — R 0 , R 2 — R \e si {R k — R^i e by Theorem 
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1.1) are disjoint. Hence, if lim*.^ R k e j/, 



proving that /x is continuous from below. 

If S 1 , S 2 ,... e j/ is monotonically decreasing and fi(S nQ ) < -f°o for 
some n 0 , we have that S nQ — Si , S nQ — S 2 is monotonically 
increasing. According to (2.3), if Hindoo S k e j/ and therefore 

lim(S„ 0 - - lim S k e si, 

we have 

= H'i'S'nQ 

Since fx(S no — S k ) = fj,(S no ) — fj,(S k ) for k ^ n 0 (see Exercise 2.1), we 
get from the above relations 

H'i^nQ S k ) = n(S no ) lim /x(*S & ). 


Therefore, fi(S) is continuous from above, as can be seen from the 
following: 

^(jh^oo Slc ) KSn. (^n 0 *^ft)) 

= KSn 0 ) - KSn 0 - Hm S k ) = lim /*(£*)• Q.E.D. 

Theorem 2*2* Every finite, nonnegative and additive set function 
F(S) defined on a Boolean algebra s/ and satisfying F( 0) = 0, which is 
either continuous from below at every R e j/, or continuous from 
above at 0 e j/, is necessarily also or additive, i.e., it is a measure. 

Proof . Let S x , S 2 ,... be any infinite sequence of disjoint sets from j/, 
which is such that R = US£=i S k e . Then the sequence R 1 , R 2 ,... 
with 

n 

Pn ~ U 

k=1 

is monotonically increasing. Since F(S) is additive, we have 

F(Rn) = i F(S k ). 

/<:~1 
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If F(S) is continuous from below at R — Ufc=i > then 


F 



= F(lim R n ) = lim F(R n ) 

'w-» CO w—>00 


= Jim I F(S k ) = £ F(S k ), 

k =1 &=1 

which proves that F(S) is cr additive. 

If F(S) is continuous from below at 0 , then since R — R 1 , R — R 2 ,... 
is a monotonically decreasing sequence with lim n ^(R — R n ) — 0, we 
have 

0 = lim F(R - R n ) = F(R) - lim F(R n ), 

n-±<x> n-* oo 

i.e., 

F(R) = limF(*„) = £ F(S k ). Q.E.D. 

ft=l 


2.3. Extensions of Measures and Outer Measures 

We saw in §1, Theorem 1.6, that a Boolean algebra is a monotone 
class if and only if it is a Boolean cr algebra. It would obviously be con¬ 
venient if we could extend a measure /x defined on a Boolean algebra s/ 
of subsets of 9£ to the Boolean cr algebra generated by j/, and 

thus obtain a measure space (2£, fi) in the sense of Definition 2.2. 

^Theorem 23 ♦ Let jjl(S) be a measure defined on the Boolean 
algebra s/ of subsets of a given set 9C. The set function 

(2.4) /!(*) = inf I f>(S ft ): Q S k D R, , S,.... e j*!. Rest,, 

[ k=l k=l ) 

defined on the Boolean cr algebra s0 o — generated by j/, i.e., for 

all R e , is a measure on which coincides with /x(S) on 

fi(S) = p(S) 9 Sestf. 

If n(S) is a cr-finite measure, then jl(S) is also cr finite, and /X(S') is the 
only measure on which coincides with /x(S) on . The measure jl(S) 
is called the extension of /x(S'). 

Consider the extended real-valued set function 

^(R) = inf I f iJ.(S k ): RC\JS k , S, ,S .e 

( k~l /c-1 



(2.5) 
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for any given subset R of SC. Since for any such R there are countable 
families S ± , S 2 which cover R, i.e., whose unions contain R y 

we can always take the infimum (greatest lower bound) of Xj£=i p(S k ) 
for all such families, i.e., /x + (R ) is defined on the family 6^ of all sub¬ 
sets of SC, it is obviously nonnegative, and it satisfies the relation 

(2.6) 

\n=l / n=1 

for any R x , R 2 ,... e . 

In order to see that (2.6) is true, note that for each given e > 0 and 
each R n there is, according to the definition in (2.5), a sequence 
S n i, S n2 e s/ such that R n C IjLi S nk , for which 

(2.7) H- + ( R n) < Z K S nk) < V- + ( R n) + ^ • 

k=l Z 

Since {S nk , n, k = 1,2,...} is a countable family of sets from s/ and 
Un-1 R n c Un,fc=i S nk , we have by (2.7) 

(2.8) ^ iO Rn) ^ t Z K s n k ) 

\n=1 1 n=1 k=1 

^ Z (^ + (^n) + — € + Z M+C^n)- 

n =1 X Z ' n=1 

The relation (2.8) is true for an arbitrarily small e > 0 and conse¬ 
quently (2.6) follows. 

Definition 2A+ A nonnegative set function M(R ), defined on each 
subset R e 6^ of a set SC, for which M(0) = 0 and which is countably 
subadditive , i.e., for which 

/ OO \ 00 

(2.9) M I (J Sn) ^ Z M(S n ) 

for any sequence , S 2 ,... g , is called an owter measure in SC. 
A subset 5 of is said to be measurable with respect to the outer measure 
M(R), or simply M measurable , if it satisfies the relation 

(2.10) M(R) = M(R n 5) + M(R n 5") 
for any subset /? of /F. 
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We have proved thus far that /x + (jR) is an outer measure in X. In 
order to demonstrate that /x + (i?) is a measure on s/ a = we have 

to prove the next two lemmas. 

*Lcmma 2*1* If M(R), R e S# , is an outer measure in 5f, then 
the class of all M-measurable sets S e 0^ is a Boolean a algebra, 
and the outer measure M(S), S e , restricted to is a measure. 

Proof. We start by proving first that is a Boolean algebra. 

Since for any i? e6 f , 

M(i?) - M(R n #) = M(R n 0 ) + M(i? n #), 
we have 0 e . 

From the relation (2.10) it is evident that if S is M measurable, then 
S' is also M measurable. Therefore, we only have to prove that S x U S 2 
is M measurable whenever S ± and S 2 are M measurable. 

If S ± , S 2 e , then for any R e 6^ 

(2.11) M(R) = M(R n S t ) + M(R n 5/); 
since also R n , R n S'/ e 6^ we have 

(2.12) M(i? n 50 - M((fl n SJ n S 2 ) + M((i? n 5 X ) n 5 2 '), 

(2.13) M(R n 5/) = M((R n 5/) n S 2 ) + M((R n 5/) n 5 2 '). 

By substituting (2.12) and (2.13) into (2.11), and noting that 

Sf n 5 a ' = (S 1 u 5 a y, 


we arrive at 

(2.14) M(R) = [M(R nS ± n S 2 ) + M(R nS t n S 2 ') + M(R n 5/ n S 2 )] 
+ M(R n (S 1 u S 2 )'). 

Since the above relation is true for any subset R of SC^ we can replace 

R in (2.14) by R n (S 1 U S 2 ). Then the last term on the right-hand 

side of (2.14) becomes M(0) — 0, while the terms in the brackets 
remain unchanged, because we easily get by means of Lemmas 1.1 
and 1.2 

[R n (S x u 5 2 )] n S ± n S 2 = Rn S 1 n S 2 , 

[i? n (S 1 u S 2 )] n S ± n S 2 = R n 5 L n S a ', 

[R n (S t u 5 a )] n 5/ n 5 a = R n 5/ n 5 a . 
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Therefore, we have 

(2.15) M(R n (S t u S 2 )) 

= M (R S a ) + M(R r\S 1 n S 2 ') + M(i? n S/ n S a ), 

i.e., (2.14) reads 

(2.16) M(tf) = M(R n (S x u S,)) + M(i? n (S x u S 2 )’), 

proving that S 1 U S 2 is M measurable. 

In order to prove that s# M is also a Boolean a algebra it is sufficient 
to consider sequences of disjoint sets S x , S 2 ,... e ; namely if 
R 1 , R 2 e is any sequence, then it is easy to show that 

oo oo n—1 

(J R n = (J K - U rX 

n =1 n =1 x k= 1 

where the sets S n = R n - (J^i 1 R k , n=\, 2,..., also belong to s/ M , 
which was shown to be a Boolean algebra, and are obviously disjoint. 

If S x and S 2 are disjoint, we have S 1 C\ S 2 = 0, S 1 n S 2 ' = , 

Si n S 2 = S 2 , and (2.15) becomes 

M(R n (S, u S 2 )) = M{R n .S\) + M(R n ,S’ 2 ). 

By induction we get 

( 2 -17) M ^R n (Q .Spj = £ M(I? n 5,). 

From the subadditivity of M(i?) it easily follows that 

(2.18) Mpn (0 S^j > M n (Q .Sp'j 

because (U/c=i 5*.)' D (U/c=i ^/c) 7 - Since (JaLi S k is M measurable due 
to the established fact that is a Boolean algebra, we get by using 
(2.17) and (2.18) 

M(R) = M (r n (0 s t )) + M n (Q S.)') 

> £ W n .S\) + M {rc\ (p S^y 


(2.19) 
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Due to the countable subadditivity of M(R ), we have 

(2.20) m(r^ (0 $„)] = M (0 (R n S 4 )j < £ M(R n .S’,). 

Since (2.19) is true for any n, we obtain by letting n —»■ oo in (2.19) and 
using (2.20), 

(2.21) M(R) > £ M(fl n S k ) + M ^ n 

^m(rc (0 I- M (i? n (0 

Due to the subadditivity of M(R ), we also have 

(2.22) M(R) < M (/? n (Q S t )) + M n (Q 

The relation (2.21) in conjunction with (2.22) shows that S k * s 
M measurable, i.e., U/c=i Sk G ^m > an d hence jaf M is a Boolean a algebra! 

In order to prove that M(R) is a additive on , we insert in (2.21) 
R = Ufc-i S k to derive 


Wu W) > i M (s k i 

\k=1 1 k= 1 

which in conjunction with (2.9) yields the desired conclusion. Q.E.D. 

*Lemma 2*2* Every set R in the Boolean a algebra s/ a generated 
by stf is n + measurable, i.e., is contained in jaf + . 

Proof. Take any set R e S# . By the definition (2.5) of /x + (i?), for 
any e > 0 there is a sequence S ± , S 2 ,... e s/ such that R C [j k=1 S k , 
for which 

(2.23) ^(R) < £ n(S k ) < ^(R) + e. 

k=l 

For any set S e jaf we have [since S k = ( S k nS)u ( S k n S'), where 
S k n S and S k n S' are disjoint] 

(2.24) £ p(S k ) = £ n S) + £ /x(.S ;c n .S') 

fc=l fc=l fc-1 


> /*"(* n -S) 
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where the last inequality is a consequence of the fact that 
Rn SC uLi ( s k n S) and R n S'C uLi (S fc n 5'). By combining 
(2.23) and (2.24) we obtain 

(2.25) p+(R) + € ^ fi+(R nS) + p+(R n S'), 

which holds for any e > 0. Since /x + (7?) is an outer measure and therefore 
is subadditive, we get 

P+(R) < p+(R nS) + p+(R n S'), 

which in conjunction with (2.25) shows that 

P+(R) = p+(R n S) + p+(R n S'), 

i.e., that S e s# is measurable. 

Thus we have arrived at the result that the Boolean algebra stf is 
contained in the Boolean a algebra + of all /x+-measurable subsets of 2£. 
Since s0 o is the smallest Boolean a algebra containing stf, we must have 
that s$ a C + . Q.E.D. 

Lemma 2.2 tells us, de facto, that the set function /I(jR) defined by 
(2.4) is indeed an extension of /x(S), Ses/, to . From the very 
definition (2.4) of fl(R) it is obvious that /I(7?) is a finite in case that /x(£) 
is a finite. 

In order to see that jl(S) is the only extension of fi(S) when fi(S) is 
a finite, assume that /+(/?), R e , is also an extension of /x(£), i.e., 

P'liS) = p(S) — fi(S), S estf. 

Let Re be a set for which at least one of the two extensions is finite, 
say /I (R) < +oo. Denote by 9ft(7?) the family of all sets in s0 o which 
are subsets of R, i.e., they are of the form R n S, S e stf a , and for which 
/l! and fi are equal, 

fL^R n S) = ji(R n S), RnSe 50i(i?). 

s JJl(7?) is a monotone class, because if S 1 , S 2 ,... e $01(7?) is a monotone 
sequence then, since fl(S n ) is finite, we have 

/I(lim S n ) = lim p,(S n ) = lim fL^S^ = /^(lim S n ). 

n -+ 00 7 Z-> 00 71—>00 7Z->00 

flJl(/?) obviously contains the Boolean algebra (see Exercise 1.8) 
=r= {/? n S: S g .c/}. Since SR(/?) is a monotone class, it must coincide 
with the Boolean a algebra ^ R {R n S: S e generated (see 
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Exercise 1.8) by stf. Obviously R e^ jjR and therefore R e9Jl(i?), which 
means that jx{R) = (R). 

If R 0 e s/ a is a set of infinite jl measure, it can be covered by an 
infinite sequence R 1 , R 2 ,... e s/ a , 


R 0 C\jR k , 

n =1 

of sets R k of finite /I measure because /I(jR 0 ) is assumed a finite. It is 
easy to check that 

°° w _i 

r 0 = u k" (*«- u R *)l 

n =1 L N k =1 

and since S {n) = R 0 n (i? n — Ufcli ^4) is obviously a set from and 
subset of R n , its /I measure is finite. According to the above we have 
then 

= piS'*'), 

and therefore 

&(*») = £ Ai(s (re) ) = £ =/w, 

w=l n=l 

which proves the uniqueness of /Z(i?). 

Theorem 2.3 is of great importance to us because it tells us that all 
the or-finite measures defined on S8q have a unique extension to the 
family 3S n of all Borel sets. This conclusion applies in particular to the 
Lebesgue measure introduced at the beginning of this section for the 
sets in , as well as to any probability measure (which is always finite) 
that has been defined on . 

Due to Theorem 2.3, when studying measures we can restrict our¬ 
selves from now on, without loss of generality, to measures defined on 
Boolean a algebras, i.e., to the study of measure spaces (see Definition 2.2). 

2.4. Cartesian Products of Measure Spaces 

An important way of building a new measure space from two or 
more measure spaces SC x ,..., SC n is by introducing the product measure 
on the direct product 3C X X ••• X SC n of the sets SC 1 ,..., SC n . 

Theorem 2*4* If {9C 1 , , /x x ),..., ( SC n , sf n , /x n ) are n measure 

spaces, there is a measure /x(*S) defined on the Boolean a algebra .$/ of 
subsets of the set SC x ••• X SC n , such that ,c/ is generated by 
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the family s# 1 X • • • X s# n of subsets of and has the property that 
for any S x e S n e s# n 

(2.26) x - x S n ) = rtSJ - /*(S W ); 

this measure is unique if the measure spaces {2 £ 1 , , /xj),..., (^ n , , /x n ) 

are a finite. 

The above measure /x is called the product of the measures /x x /x n , 
and the measure space ( SC , j/, /x) is called the Cartesian product of the 
measure spaces (2£ 1 , , /x^,..., (^ n , , /x n ). 

In order to prove the above theorem, we establish first the following 
lemma. 

*Lcmma 23 ♦ If are Boolean algebras of subsets of the 

sets SC X 9£ n , respectively, then the family ,..., of finite 

unions of sets belonging to s# x X ••• X <stf n is a Boolean algebra of 
subsets of SC = ^ X ••• X , and is identical with the 

family of disjoint unions of sets from X • • • X s$ n . 

Proof. It is obvious that ,..., srff) is closed under the opera¬ 
tions of taking unions of sets from ,..., srff). Furthermore, if 

S e ,..., j^4), then S can be written in the form 

m 

S = (J (S* X - X S£), S* e A = 1,..., m. 

fc=l 

Since for any R = R 1 X • • • X f?„ 6 X • • • X , we have 

(2.27) R' = 0 U - U ( z i x - x R/ x - x Z n ) 

3—1 Z^{Ry%R\} Z n £{R n .R n } 

we get by application of Lemmas 1.1 and 1.2 


S' = n (sf x - X s%y = u 

fc=l i=l 

where r is some positive integer, and each T l is an intersection of a 
finite number of sets from s$ x X ••• X s$ n . But, since for an inter¬ 
section T of sets X ••• X R 5 n , j = 1,..., from X ••• X we 
have 

(2.28) r = n x - x *j) = (n *?) x - x (n 

j-i \j.~i / M n -i / 
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[i.e., T is also an element of j/ n )], we conclude that 

S' e s/ n ). Thus, is a Boolean algebra. 

Any element 


S = \J s\ S\... 9 S m Es/ 1 x ••• X < 

k=l 

of si n ) can be written as the disjoint union of sets 

Ri = Si - Ut'i S\ 

m-1 

S = S 1 U(S 1 — S 2 ) u - u 5 m - (J S k 

' fc=l ’ 

= S 1 u (S 1 n (S 2 )') u - u |s m n (ft 0 s *)' 

By applying (2.27) and (2.28) on each of the sets R 1 ,-.-, in the above 
union, we arrive by induction at the conclusion that J? 1 ,..., R m are 
disjoint unions of sets from X ••• X si n , i.e., that S can be written 
as a disjoint union of sets from X ••• X <s$ n . Q.E.D. 

The Boolean algebra ,..., J^4) is obviously the Boolean algebra 

generated by X • • • X srf n . It is easy to check that the set function 
defined on a set R e ,..., si n ) 

m 

R = \J R k , R k = R k x x R^e x ••• x , 

fc =1 

i?* n = 0 for i ^ k , 



by the expression 

m 

(2-29) /*(*) = -/*«(*£) 

fc=l 

is a measure. Due to Lemma 2.3, /x(i?) is defined by (2.29) on the entire 
Boolean algebra ,..., si n ). Evidently, /x(i?) is or additive if 

fJ. n (R n ) are ct additive. 

The Boolean or algebra si generated by si x X ••• X si n is obviously 
identical with the Boolean or algebra generated by SP(si x ,..., si n ). Conse¬ 
quently, by Theorem 2.3, there is an extension of /x(i?), R e 
to si, which is unique when fi(R) is or finite. Thus, the proof of Theo¬ 
rem 2.4 is completed. 

Note that we can apply Theorem 2.4 to /x x (R ) — ••• = /z n (B) = ijl { 1 1 ) (B) } 
B e 3S 1 , in order to arrive at a measure /z ( j n) (B), B e 3S n , called the 
Lebesgue measure in .‘3? n (see also Exercise 2.3). For an interval 



2. Measures and Measure Spaces 


79 


I — {a x , b ± ) X ••• X ( a n , b n ) e ^ n , we have according to (2.29) and 
the definition of p\ 1) {B ) 

d n \i) = (*i - - (*. - «.)• 


Exercises 

2*1 ♦ Show that every measure p(S) on a Boolean algebra is a 
monotone set function, i.e., if S C i?, i?, S' e s/, then p(R) ^ /x(S), and a 
subtractive set function, i.e., p{R — S) = p(R) — p(S) whenever SCR, 
R,Se^C. 

2*2* Define a set function p { 8 n) (B) on the Boolean or algebra 38 n of 
all Borel sets in 88 n by writing p { 8 n) (B) = 1 if Be 88 n contains the origin 
of (R n , and p { 8 n) (B) — 0 if B does not contain the origin. Show that this 
set function is a finite measure; we shall call this measure the S measure. 

2 3* Let p(x 1 ,..., x n ) be any nonnegative Riemann integrable func¬ 
tion on [R n . Show that the set function p p (B) on the Boolean algebra 
which is defined for intervals I e J n by the Riemann integral 

A^pCD ^ J* p{ x 1 x n) dxi ••• dx n , 

and for a general element B = U ••• U I k , {I± ,..., I k e J n , l i n I 5 = 0 
if i j ) of 3&1 by 

P-piB) = /^(A) + “' + H'oih) 

is a measure (called a Lebesgue-Stieltjes measure induced by p) if we take 
Pi(0) = 0. Why is Pi(B) necessarily a finite ? 

Note . If p(^!,..., # n ) = 1, the corresponding measure pi(B) = p\ n) (B) 
is called the Lebesgue measure on 38 n . 

2 A. Is it true for p\ 1) (B ) defined by (2.2) that /x (1) (lim fc _ >00 B k ) = 
limfc^oo p { l\B k ) for every monotonically decreasing sequence of Borel 
sets in ? 

2*5* If {SC, s#, p) is a measure space, the set function 
P + {R) = sup{p{S): S C R, S e s/} 

is obviously an extended real-valued function defined for every subset 
R e 3% of SC\ this set function is called the inner measure induced by p. 
Show that if R 1 , R 2 ,..., e 3% is any disjoint sequence of sets, then 
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*3* Measurable and Integrable Functions 

3.1. The Concept of a Measurable Function 

In the present section our main purpose is to define and study the 
integral 

jmMO 

of a function/(f), f e SC, defined on a measure space* SC = ( SC, (i). 

To do this we need the concept of a measurable function. 

Definition 3*1 ♦ A real-valued function /(f), fe#*, defined on a 
measurable subset R of the measurable space SC = ( SC y s/) is said to 
be measurable on R if for every open interval IC U 1 the subset 

(3.1) = 

of SC is measurable, i.e., it belongs to $0. If /(f) is an extended real¬ 
valued function, i.e., if it can also assume the values ±°°» then/(f) is 
said to be measurable on R if in addition to the sets (3.1) the sets 

/-H + oo)={f: fetf,/(f)= + oo}, 
f e *,/(*) = -oo} 

are also measurable. An extended complex-valued function A(f) is 
measurable if its real part Re /i(f) and imaginary part Im /i(f) are 
measurable. 

In case that the function/(f), f e IR n , is measurable in (IR n , S8 n \ i.e., 
with respect to the family of all Borel sets, then/(f) is said to be Borel 
measurable ; if /(f) is measurable with respect to the larger class of 
Lebesgue measurable sets—i.e., with respect to the sets SCR n which are 
measurable (see Definition 2.4) with respect to the Lebesgue outer 
measure /z| (n) (i?), R e S ren , defined by (2.5) from the measure fi\ n) (B) y 
BeS$% y 

= I dx x ••• dx n (Riemann integration on B) 

—then/(f) is said to be Lebesgue measurable. Note that the class of 
all Lebesgue measurable sets is, according to Theorem 2.3, a Boolean 

# According to the agreement we have followed until now, we denote the measure space 
(&, im )—consisting of the set SC on which a Boolean a algebra si of subsets of SC and a 
measure /x(JR), JR e si, are defined—also by the letter SC, 



3. Measurable and Integrable Functions 


81 


or algebra which contains* the family of all Borel sets in R n ; therefore, 
every Borel measurable function is also Lebesgue measurable. 

The characteristic function Xs(£) of any subset S of a set SC is defined 

<“> ‘z m. 

As a simple example of a measurable function we can take the charac¬ 
teristic function Xr(€) °f a measurable set R. 

A slightly more complicated example is provided by a simple function 
on a measurable space SC. 

Definition 3*2* A function /(£) on a measurable space is called 
simple if it is of the form 

/(£) = E a *XsM), Si n S } - = 0 for i ^ j, 

fc=l 

where a x a n are, in general, complex numbers and Xsf£) * s the 
characteristic function (3.2) of the set S k . 

As an example of Borel measurable functions on U n we have the 
continuous functions. Namely, if f(x) is a real continuous function 
defined on an open subset of U n and I is an open interval in U 1 , then 
/ -1 (7) is an open set in (R n ; therefore f~\I) is a Borel set in U n by 
Theorem 1.4. Since the real and imaginary parts of a continuous complex 
function defined on an open subset of (R n are also continuous, we can 
state the following. 

Theorem 3*1 ♦ Every continuous complex function defined on an 
open subset R of (R n is Borel measurable on R. 

3.2. Properties of Measurable Functions 

The measurable functions are important to us only to the extent that 
an integrable function has to be first of all measurable. In the theory of 
integration we shall need certain general properties of measurable 
functions, which we state in Theorems 3.2-3.5. 

Theorem 3*2* If the extended real-valued function /(£) defined 
on the measurable subset R of the measurable space SC — ( SC , sC) is 
measurable, and if B is a Borel set on the real line, then / -1 (i?) is a 
measurable set in SC . 

# It can be proved [Halmos, 1950, §13] that every Lebesgue measurable set is of the 
form B A S t where B e 96 n and S is a subset of a Borel set B x , with B x of zero Lebesgue 
measure, i.c., ™ 0. 
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Proof. Denote by the family of all subsets S of R 1 whose inverse 
images f~\S) in X are measurable sets. We will show that ^ is a 
Boolean or algebra. 

Since we can write for any , S 2 ,... e IF 

/- 1 (0 = u /- w . 

\fc=l / k=l 

we conclude that Ufc=i Sjc e ^• Furthermore, for any subset S of R 1 we 
have 

f~\S') = [/"W. S' = R 1 — S, [/"W = X - f~\S ). 

Consequently, if Se^, then we see that / _1 (£") e j/, i.e., / _1 (£") is 
measurable, and therefore S' e 

Thus, ^ is a Boolean or algebra which contains the family of all open 
intervals in R 1 . Every closed interval I c in R 1 has as a complement either 
an open interval (if I c is infinite) or the union of two open intervals (if I c is 
finite) and therefore it has to belong to the Boolean or algebra . Since 
any other element of J> x can be written as the difference of two closed 
intervals (e.g., [a, b) — [a, b] — {6}), we conclude that J n C X . Conse¬ 
quently, must contain the Boolean or algebra SIP generated by X 1 . 

Q.E.D. 

Theorem 3*3* Let /(£), £ e R, be a real measurable function on 
the measurable set R C X, and let g(x ), x e R 1 , be a Borel measurable 
real function defined on a subset of the real line, which contains the 
range of/(£). Then the composite function h(£) — #[/(£)] is a measurable 
function. 

Proof. Since g(x) is Borel measurable, for every open interval I e X 1 
the set £ -1 (/) is a Borel set. Consequently/~ 1 [^ r “ 1 (/)] is measurable in X 
by Theorem 3.2, because/(£) is measurable, i.e., 

h~\I) = f-'[g~\I)] 


is measurable. Q.E.D. 

Note that it follows immediately from the above theorem that for 
any real number a the functions af(£), a + /(£) and |/(£)| a are mea¬ 
surable whenever /(£) is measurable; we get this result by taking g(x), 
x e R 1 , to be equal to, respectively, the continuous (except at x = 0 in 
the case of | x | a , a < 0) and therefore Borel measurable functions ax, 
a + x, and | x \ a . 

In order to prove Theorem 3.4 we need Lemmas 3.1 and 3.2 (see also 
Exercise 3.1). 
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Lemma 3*1 ♦ If /(£) and g(£), £ e R, are real-valued measurable 
functions on the measurable set R, then the set 


is a measurable set. 

Proof. Since the set 91 of all rational numbers is countable, we can 
write it in the form of a sequence 

91 = fa , ^ ,-}• 


For each r k e the sets 

/"%* , +oo)) = {£:/(£) > r fc }, 

^“ 1 ((-° 0 » '*)) = {£ £(£) < '*} 

are measurable on 8£> because they are inverse images of open intervals. 
It is easy to see that 

(3.3) «:/(f) > m = 0 (tf :/(0 > " (f: *(£) < rj) 

1 

by noting that whenever for some £ e R we have /(£) > ^(£), there 
must be a rational number r for which /(£) > r > £(£). Since 
{£ : /(£) > r fc} n {£ : #(£) < r k} as ^e intersection of two measurable 
sets is also measurable, we conclude from (3.3) that {£:/(£) > *(£)} is 
measurable. Q.E.D; 

Lemma 3*2* A real-valued function h(£), £ e 9E y on the measurable 
set R is measurable if for any real number c 

h~\{~ oo, 0) = {f: *(P < 
is a measurable subset of 

Proof. For any finite open interval (a , 6) we have 

(3.4) h~\(a, b)) = {(: a < h(Q < b} 

= {(: h(0 <b}~ {(: h(() < a}. 

Since we can write {£: h(£) ^ a} as an intersection of a countable number 
of sets which are measurable according to the assumption 

{€•' m < a) = n A(0 < « + l/»}, 

«~1 
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we conclude that {£: h(0 < a) is measurable (see Exercise 1.9), i.e., 
both sets appearing on the right-hand side of (3.4) are measurable, and 
therefore h~\(a t b)) is measurable. Q.E.D. 

Theorem 3*4* If gi(x) t g 2 (x) y ... is a sequence of extended complex¬ 
valued functions which are measurable on the measurable set RC3£, 
and if 

g( x ) = J"££•(*) 

exists, then g(x) is an extended complex-valued function which is 
measurable on R. 

Proof. It is easy to check with the help of Lemma 3.1 (see Exer¬ 
cise 3.3) that the functions 

h n {x) = sup Re g k (x) 

k=n,n+ 1,... 

are measurable on R. [Im g k (x) are treated similarly]. Since we have 
Re g(x) — lim Re g n (x) — lim h n (x) — inf h n (x) 

n-»oo n->oo n=1,2,... 

because h x {x) > h 2 (x) > ••• , and as inf n=1>2> h n (x) is measurable if 
Ai(^), h 2 (x),... are measurable functions (see Exercise 3.3), we conclude 
that £(#) is measurable on R. Q.E.D. 

Theorem 3.5. If f(£) and g(£), £ e R> are extended real-valued 
measurable functions defined on the measurable set R, and if 

(3.5) {f(£) = ±oo} n {g(£) = ±oo} = 0, 

then the functions f(£) + #(£) and/(£)#(£) are also measurable. 

Proof. Due to the requirement (3.5), the functions f(£) + g(£) and 
f(£) g(£) are defined everywhere. The sets 

{m +g(o - - {m - ^ {g(o - 

img(£) - ±oo} 

= mo = n {g(0 > 0}] u [{/(£) > 0} n {g(0 = ± 00 }] 

obviously are measurable. Therefore, we can concentrate on the set of 
points at which f{£) and g(£) are finite. 

For any real number c we can write 

(3.6) {£: oo </(*)+*(*) <c} 

= {0 -oo <m < c -g(€)} n {(: g(£) > -oo} 
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and since c — g(£) is measurable, we conclude from Lemma 3.1 that 
the set on the left-hand side of (3.6) is measurable. Therefore,/(£) + g(£) 
is measurable by Lemma 3.2. 

The measurability of f(£)g(£) follows immediately from the relation 

mg(o = a\m +^(ai 2 - 1/(0 -*m 

which is true whenever /(£) ^ ± 00 , g(£) =£ ±oo. Q.E.D. 

3.3. Positive-Definite Integrable Functions 

In discussing the concept of measurable functions it was sufficient to 
consider only measurable spaces SC — (SC, j/). For the study of the 
general theory of integration we need functions /(£), £ e SC, defined on 
measure spaces SC = (SC, sf, p). In order to introduce the general concept 
of integral of such functions we have to start gradually by defining first 
integration for the simple functions introduced in Definition 3.2. 

Definition 33♦ We say that the simple function (see Definition 3.2) 

s (0 = Z a kXs t (£), 

k=l 

defined on a measurable set R of the measure space SC = (SC, stf, fi) is 
integrable on R if 

lx(R n {£: s(0 ^ 0}) < oo. 

Its integral with respect to /x is then defined* to be 

(3.7) f *(f) 4*(f) = £ a^(S k nR). 

J R k =l 

Next we define the concept of an integral on SC for nonnegative 
functions: 

Definition 3*4* An extended real-valued nonnegative function /(£) 
defined on a measurable subset R of a measure space SC = (SC, stf, fi) is 
said to be integrable on R if /(£) is measurable on R, and if there is a 
nondecreasing sequence 

(3.8) *i(0<*.(fl<- 

# In the case that /*(/£ n {£ : s(() ~~ 0}) ~ -| - go, the sum appearing in (3.7) is still 
well defined, because we have agreed in §2 to write 0 • (-|- oo) ---• 0. 
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of simple functions, each integrable on R y which converges at each 
point £eR to/(£), 

(3.9) /(f)=lim,„(f), 

oo 

and is such that lim n ^ 00 Js n (f) dfi(£) exists and is finite. The integral of 
/(f) on R is defined as 

(3.10) f /(f)<Wf) = lim f s n (£)d^). 

J R n^o o J R 

Note that the above definition by which/(f) is the limit of simple 
integrable functions implies that the measure of R n (f:/(f) > 0} is 
or finite, because 

R n (f:/(f) > 0} = Q [R n {f: ,*(f) > 0}], 

fe=l 

and since s fc (f) is integrable, /x({f: s fc (f) > 0}) < -fax 

Conversely, if S 0 = R n (f:/(f) >0} is or finite, we can write 
S 0 = Un=i Rn > where jR x , jR 2 >••• is a monotonically increasing sequence 
of sets of finite measure. The sequence s n (f), n — 1,2,..., of simple 
functions defined by 

1+712"/ _ 1 

*«(£) = £ ~yr~ 

k =1 z 

s 0 -R n , 

R n n jf: ^ < ^r|’ * = 2...., «2*. 

«.n{f:/(f)^») 

is a sequence of integrable simple functions which satisfies the require¬ 
ments (3.8) and (3.9). Thus, if /x( R ) is a finite, we can always define the 
limit appearing in (3.10). 

Now we have to establish that if the limit (3.10) exists, then it is 
independent of the choice of the sequence of simple functions. For that 
we need the following lemma. 

Lemma 3*3* If 0 ^ (0 ^ ^ •** , f e R, is a nondecreasing 

sequence of simple functions integrable on the measurable set R in the 
measure space SC = (SC , j/, /x), and if s(f), f e R, is also a simple function 
integrable on R , and such that 


(3.11) 


= 

S? = 


o(w) _ 

°1+7I2" — 
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then 

(3.13) f if s n (S)driS)- 

J R n^oo J R 

Proof. Since s(£) is integrable on R, the set 

R 0 = {£: stf) > 0} 

is a measurable subset of R , and we obviously have by (3.7) 

f Ki) = f s(i) d(. 

J R J R 0 

Assume that /x(i? 0 ) > 0—because if /x(i? 0 ) = 0, (3.13) would already be 
true. 

For each n = 1, 2,... the set 

is fi measurable (see Lemma 3.1 and Exercise 3.1). Since s^) ^ 
s 2 (£) ^ ••• , we have S x C S 2 C ••• ; due to (3.12) we can deduce that 
lim n ^ 00 S n = R 0 . By the continuity from below of (jl(S) (Theorem 2.1), 
we get 

(3.14) limtiSn) =KRo)- 

n- 4 oo 

The simple function s(£) is necessarily, bounded: s(£) ^ M. Thus, by 
taking into consideration the rules for dealing with integrals of simple 
functions (see Exercises 3.5 and 3.6), and since s n(£) dp(£) ^ 0 
because s n (£) > 0, we get 

(3.15) f s n (€)drii) > f s n (i)drii) 

J R J S n 

= f <0 MO - f <0 MO - -tL: Ks n ). 

J R 0 J R 0 -s n l x { K n) 

When n —► oo, we find from (3.14) that the second term on the right- 
hand side of (3,15) vanishes because 

f ^M^(R 0 -S n ) 9 

J *o--Sn 
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while the third term converges to e. Thus we have arrived at the conclu¬ 
sion that 

lim f s n (£) dp(g) > [ s(f) dp,(£) — e 
n ~*°° J r Jr 

is satisfied for every e > 0; therefore, (3.13) holds. Q.E.D. 

Theorem 3*6* If s /£), s 2 ( £)>••• and s i(£)> are two non “ 

decreasing sequences of nonnegative simple functions converging at 
each £ e R to/(£) ^ 0, then the limits lim^^ or n and lim^^ cr n ' of the 
integrals 

== f *n(0 dfl(^) 9 

J R 

<*n = f s n{£) dp{£) 

J R 

are equal in case that at least one of the sequences a 1 , cr 2 >••• and a i, 
converges. 

Proof. Assume that g 1 , cr 2 >••• converges. For any n we have 
s n(0 </(£) = ^ m fc->oo Sk(0> and therefore by Lemma 3.3 

u n ' ^ lim <r fc . 

fc-400 

It is easy to establish from (3.7) that or/, of is a monotonic non¬ 
decreasing sequence, which consequently converges because it is 
bounded. We have 

lim of ^ lim o k . 

71-400 &-400 

By noting that again according to Lemma 3.3 

cj k < lim of 


for any k = 1, 2 ,..., we conclude that 


lim o k ^ lim of. 

fe-400 77-4 00 


Hence lim^ a k = lim^ a n ’. Q.E.D. 

Lemma 3.4, If /(f), f e R, is an extended real-valued measurable 
function, then the functions 

/ + (f) = max{/(f), 0}, 

/"(f) = / + (f) ~/(f) = max{—/(£), 0} 


are also measurable. 



3. Measurable and Integrable Functions 


89 


The above lemma follows immediately from Theorem 3.3 if we note 
that the function g(x) — max{x, 0} is a continuous function on M 1 . 

3.4. Real and Complex Integrable Functions 

Definition 3*5* The extended real-valued measurable function /(£) 
defined on a measurable subset R of a measure space SC — (SC, stf, fi) is 
fji integrable on R if its positive part /+(£) and its negative part/“(£), 
defined by (3.16), are /x integrable on R . Then its integral is defined to 
be 

(3.17) f /(0<Wfl = f f 

J R J R J R 

An extended complex-valued measurable function h(£), £ e R, is inte¬ 
grable on R if Re h(£) and Im h(£) are integrable on R, and its integral 
is taken to be 

(3.18) f A(f)4t(f)=f ReA(0^(0+if Im A(f) d^i). 

J R J R J R 

We have thus arrived at a very general concept of an integral. When 
we consider integration on the real line, i.e., SC = IR 1 , and we take 
R = [ a , 6], we obtain a generalization of the Riemann integral 

C b 

f(x) dx 

J a 

of the function f(x) if we take the measure space (IR 1 , SS 1 , /x^), with the 
Lebesgue measure /x ( / 1) ( B ), BeSS 1 . For instance, assume that/(#) ^ 0 
is continuous in [a, b]. Then by Theorem 3.1 f(x) is measurable 
[note that /x ( I 1) ({a}) = ^({b}) = 0]. If we build for a partition 
a = x 0 < x x < • • • < x n = b of [a, b] the Riemann sum 

n 

O n = Z <**(** - **-l)> a * = inf {/(*): X k~l < * < 

we note that or n corresponds to the integral (3.7), 

O n = I *„(*) (*), 

J [a,&] 

of the simple function 

n 

S n (x) = ^kXsJ< x )y ^k ~ { ,x '* x k —1 ^ x < '* 
fc~l 
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If we take a sequence of finer and finer partitions of [ a , b ] in which the 
(n + l)st partition is a subpartition of the nth one, so that 
max fc=1 n | x k — x k+1 | —> 0 when n—> oo, then obviously the non¬ 
decreasing sequence ^(tf) < s 2 (x) < • •• converges to f(x), and by 
Definition 3.4 

(3.19) f f(x) d l i ( 1 °{x) = lim <j n . 

J la, i>] 

On the other hand, from the definition of the Riemann integral we also 
have 

fix) dx = lim o n , 

J a n->co 

proving the equality of these two integrals. 

The integral 

f /(*) d^\x) 

J R 

of a function on U 1 is called the Lebesgue integral. It is ordinarily 
denoted by 

f {x) dx , 

J R 

like the Riemann integral, to which it is equal in the case that the 
Riemann integral exists. However, as we can see in simple examples 
(see Exercise 3.4), the Lebesgue integral exists also in cases in which 
the Riemann integral does not exist. In fact, the following theorem (first 
proved by Lebesgue, whose proof we do not reproduce*) is true. 

Theorem 3*7* A bounded function defined on [a y b] CIR 1 is 
Riemann integrable on [a y b] if and only if it is continuous almost 
everywhere on [a, b]. In that case the Lebesgu^ integral of f(x) on [a y b ] 
exists and is equal to the Riemann integral of j(x) on [a y b]. 

In Theorem 3.7 the expression “almost everywhere” appears. This 
expression is very frequently used in the theory of measure. 

Definition 3*6* A statement concerning every element £ e S of a 
measurable set S in measure space SE is said to be true almost everywhere 
in S y if it is true on a set R C S whose complement S — R with respect 
to S is of measure zero, i.e., /x( R) — /x(S). 

The fact that sets of measure zero are not essential in computing 
integrals can be observed in the following theorem. 


* See Munroc [ 1953, Theorems 24.4, 24.5]. 
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Theorem 3*8* If h(£)> £ e R y is any measurable* function on the 
set R of measure zero, then h(£) is integrable on R and 

(3.20) f h(t)d^(t) = 0. 

J R 

Consequently, if/(£) — g(£) almost everywhere on a measurable set S y 
and/(£) is integrable on S y then g(£) is integrable on S and 

(3.21) f mdp(€)= f g{€) drift 

J s J s 

Proof. In case that h(£) > 0, (3.20) can be very easily proved by 
resorting to sequences (3.8) of simple functions, and noting that in 
our case (3.7) is always zero. By means of Definition 3.5 this result can 
be immediately generalized to any real, and consequently any complex 
measurable function. The second part of the theorem and (3.21) follow 
by taking A(f) = /(f) - g(£). Q.E.D. 

Theorem 3*9* If f(£) and g(£) are measurable functions defined 
on the measurable set R y if |/(f)| ^g(£) almost everywhere on R y and 
if g(£) is integrable on R , then/(£) is also integrable on R y and 

I f mdrio |< f f Adrift 

Proof. From Definition 3.5 of the integral for real and complex func¬ 
tions it is obvious that the theorem is true in general if it is true for non- 
negative functions/(f), since (Re/(f))± < |/(f)|, (Im/(f))± < |/(f)|. 

Consider therefore the case when/(£) ^ 0. Since |/(£)| ^g(£) on a 
set R 0 C R , where /x(i? — R 0 ) = 0, we have 

[R 0 n {(:/(() > 0}] C [R 0 n {£: g({) > 0}]. 

Therefore, as *n{f:*(f) > 0} has a a-finite measure because g(£) is 
integrable on R y the set R n {£:/(£) > 0} has to be also of or-finite 
measure. Consequently, the sequence ^(^) ^ s 2 (£) ^ ••• of simple func¬ 
tions (3.11) which are integrable on R and for which lim^oo s n (£) = /(£), 
can be built. 


* In measure theory the measure spaces are usually extended to so-called complete 
measure spaces, having the property that any subset of a set of measure zero is also 
measurable and of measure zero. This device makes any function h(g), £ e R, auto¬ 
matically measurable on the set R when fi(R) -- 0. 
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By Lemma 3.3, given any nondecreasing sequence $/(£), of 

simple functions converging to g(£) for every £ e R> we have 

°n= [ s n(€) < lim f s k (ij) dfx(i) — f g{£) dfi(i) 

J R k-^+CC J R J R 

for every n = 1, 2,... . Consequently, the sequence cr 1 < cr 2 < ••• is 
bounded from above and therefore it has a finite limit, which by defini¬ 
tion is 

lima n = f mdfi(i). Q.E.D. 

n-+°o J R 

3.5. Infinite Sequences and Sums of Integrals 

^Theorem 3*10 (Lebesgue monotone convergence theorem). Let 
/i(£),/ 2 (£),... be a monotone sequence of nonnegative functions, each 
integrable on the measurable set R. The extended real-valued function 

(3.22) /(f) = lim/ tt (f), (eR, 
is integrable on R and 

(3.23) f mMS) = lirn f U£)dri$), 

J R n-> oo J R 

provided that in case of a monotonically increasing sequence the limit 
in (3.23) is finite. 

Proof. Consider first the case when /i(£) <T/ 2 (£) ^ ••• . For each 
n ~ 1, 2 ,..., let s nl (£), s n2 (£) y ... be a nondecreasing sequence of simple 
functions for which 

bn* s nk (£) = f n (£). 


and consequently 

lim f s nk (£)d^) = f f n (i)d i u(i). 

«->oc j R Jr 

For each positive integer k the function 

^nk(£) ~ m £^ ax n ^wfc(^) 

is evidently a simple integrable function. From the above definition 
and the fact that (i) we get 

s 'k.k(i) ^ s 'kk.i-i(0 4flfc+l(£)> 
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i- e -> 4i(£) < 4s(£) < • Since s' kk (£) </ fc (f)> we have for each n 

/«(f) 4 lim s' kk (£) 4 /(f), n = 1, 2,... . 

By using (3.22) we deduce from the above inequality that 

(3-24) lim4 ft (f) = /(f). 

«-> 00 

Since 4*(£) 4 /4f), we can infer that 

(3-25) hm44f) 4 jjm/*(£) < +oo. 

The function /(f) is measurable over 7? because /(f) = sup^^/^f) 
(see Exercise 3.3). Furthermore, (3.24) and (3.25) show that /(f) is 
integrable over R and that 

(3.26) f /(f) 4*(f) = lim f 4 ft (f) d^( f) < lim f / B (f) 44f). 

«/ j(f ft > CO J £ 71 - >00 J u 

On the other hand, since/„(f) 4/(f), we have according to Theorem 3.9 

(3.27) f / re (f) 4*(f) < f /(f) 44 f), « = 1, 2,.... 

■'j x 

The inequalities (3.26) and (3.27) yield (3.23). 

In case that/^f) >/ 2 (f) > ••• , the sequence ^(f), g 2 (f),..., 

g»(£)=M£)-/»(£), n= 1,2,..., 

is nondecreasing, and according to the above, its limit 

g(£) = 1™ ^(0 = /i(f) - lim/„(f) 

n->oo 7i—>oo 

is integrable over i?: 

J = Bmf gn(i)M€)< f /i(f)^Wf). 

J1? «^°° J If 

Consequently, the function 

lim/ w (f)^/i(0-^) 

7Z->00 

is integrable over R, and since 

f lim/n(f)^(f) = f /,(f) 440 - f j-(f)44f), 
we see that (3.23) is again true. Q.E.D. 
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^Theorem 3*11* Let R x , R 2 ,... be an infinite sequence of disjoint 
measurable sets, and let 


00 

R= U R n 

n=l 

be their union. If the extended real-valued function /(f) is integrable 
on each set R n , then /(f) is integrable on R if and only if 

£ f 1/(01 Mi) < +oo, 

n—1 Rn 


and in that case 


f f{i)Mi)= £ f f(i)Mi)- 

J R n=l J 

Proof. In case that/(£) > 0, we can introduce the functions 

for f e R 1 u ••• u R n 

n 

for £ e R — U R k , 

Tc=1 



which are obviously integrable on R. The sequence /^f) </ 2 (f) ^ ••• 
has/(f) as a limit, and satisfies the conditions of Theorem 3.10. Thus 
we have shown that/(f) is integrable on R if and only if 

lim £ f f(i)Mi) = lim f Mi)Mi)= f f(i)Mi), 

w ^°° A;=l J R k W ^°° J R J R 

which proves the theorem for the case when/(f) ^ 0. 

The more general case when/(f) is not nonnegative everywhere in R 
can be reduced to the already considered case by treating/+(f) and/~(f) 
separately. Q.E.D. 


3.6. Integration on Cartesian Products of 
Measure Spaces 

We shall study now the problem of integration on Cartesian products 
of measure spaces in relation to integration on each of the spaces entering 
in such a Cartesian product. In order to simplify the notation, we shall 
limit ourselves to the statement and proof of results for products of 
two measure spaces. These results can very easily be generalized to the 
case of products of any finite number of measure spaces. 



3. Measurable and Integrable Functions 


95 


^Theorem 3*12* If the extended real-valued function /(ft 77 ), 
£ e SC, rj is measurable on the Cartesian product {SC X Stf, s/) of 
the measurable spaces SC = {SC, stff) and then the functions 

/„(£)=/(f.ij), fear. 

My) = f(£,y), ye®, 

are measurable functions on SC and ( W, respectively. 

Proof. We shall give only the proof for /„(£) since the proof for f € { 7 ]) 
proceeds along identical lines. 

Let / e/ 1 be a one-dimensional interval. We have for fixed 7 ] e <3/ 
(3.28) fC\I) = {tmel} = {ft/(ft * 7 ) e/} 

= {f: (ft = 

we have introduced above the notation in which, for any subset S of 
SC X we denote by S,, the subset of SC containing the points £ for 
which (ft 7]) belongs to S, i.e., 

S v = {ft (ft ii) 6 5}, 5Cf xf. 

We shall prove now that if S C SC X ^ is measurable in SC X 
then for any given 77 e ^ the set £„ C SC is measurable in $T. 

If S' C SC X ^ is of the form S 1 X S ' 2 and if S is measurable, i.e., 
if S e j/, then by definition S 1 e ^ and S ' 2 e • Since in this case 

if ,eS* 

" (0, if 

and 0 ej/i, it follows that S,, is measurable in 5f. 

Denote by the family of all subsets S of SC X SM which are such 
that S v e j / 1 for all 77 e SU. It is easy to verify that §* is a Boolean cr 
algebra. Since, according to the above, contains X > while by 
definition is the smallest Boolean cr algebra containing X , it 
follows that stf C 

Thus, we have proved that S v e for all 77 e whenever S e jaft 
In particular we can deduce from (3.28) that 

for all I g since/(ft 77 ) is measurable, and consequently f~\I) e 

Q.E.D. 
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^Theorem 3 A3 (Fubini’s theorem). Let (£T, , ju, x ) and (2/, ^2 > fo) 

be cr-finite measure spaces and ( 9C X j/, /x) the Cartesian product 
of these spaces. If the function /(£, 77 ) is integrable on the measurable 
set S e j/, then the functions 

g(£) = f m V) <Mn), = {{': (f, V) e S) 

J Sf 

(3.29) 

Kv) = f m V) dp i(0, , 6 S* = (f, V) e S} 

s v 

are almost everywhere defined and integrable on S ' 1 C and S 2 C <2/ 
respectively, and 

(3.30) \ m-n)dM,- n )=\ g{£)dp x (f) = f ^)f m*t)dM 

J S J s 1 J S 1 J Sg 

= f h (v) djJ-Av) = f d^Av) f / (f, v) dft 1 (i). 

J S 2 j j ^T, 

Proof. We shall first prove the theorem for the case when/(£, 7 ]) is 
the characteristic function xs(£> v) °f a measurable set S e j/. 

Consider the case when S — S ' 1 X S' 2 , S 1 g ^ , S ' 2 e • We have 

*{0 = xsAi) m. (S*), h(ri) = Xs Av) nAS 1 ), 

and consequently 


g(i) d/j-AQ = ih( s1 ) i j -A s2 ) = f K 7 )) d/j-Av)- 

s 1 J s 2 


Since 


f Xs(£, 1?) « drj = /X(S) = ^(S 1 X S 2 ) = /x^) ,x 2 (S 2 ), 

J s 

the relation (3.30) is verified for the case where Sgj^ X rf 2 * 

Denote by JT the class of sets S e si which are such that for their 
characteristic functions Xs(£> v) the theorem is true. It can be inferred 
directly from Theorem 3.10 that JT is a monotone class. We have seen 
that each S e s$ x X ^ belongs to Furthermore, we can extend this 
result to any finite (or even countably infinite) disjoint union of sets 
from X s &2 by using Theorem 3.10. Since the family of all finite 
disjoint unions of sets from X ^2 * s identical to the Boolean algebra 
generated by X (see Lemma 2.3), which is therefore contained 
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in JT, we arrive at the desired result that $0 C JT by employing 
Theorem 1.7. 

Thus, we have proved that the theorem is true whenever /(£, rj) is 
the characteristic function of a measurable set; this result is correct 
even where the integration is carried out over a set S which is different 
from the set R whose characteristic function is being integrated, as can 
be easily derived from the relation 

f xr(£) d i4£) = f xnnsit) d K0- 

J s J Rns 

From this statement we can infer immediately that the theorem is true 
for any simple function which is integrable on S e s/. 

Now 7 let/(|, 7 ]) be a nonnegative function integrable on S e j/. Choose 
a sequence /x(£, ?]) ^ /«(£, v) ••• of simple functions integrable on R> 
for which almost everywhere 

tj) =/(£, y), 

n~* oo 

and consequently 

(3.31) lim f v ) d t ,(e ,,) = (' m ,) d^e, v). 

n ~> 00 J s J S 

The sequence of functions g^), g 2 (£)>---> 

gnd) = f fn{L v) dlL 2 (r)\ 

is nondecreasing, and by Theorem 3.10 we have, almost everywhere, 
lim gn(i) = f /(£ v) d ^(v) = g(£)- 

n~* 00 J ^ 

A second application of Theorem 3.10 yields the facts that g(£) is 
integrable on S 1 and that 

(3.32) f = f g n (i)d^). 

J sl n-* oo J 5l 

Since / n (£, r/), n = 1, 2,..., are simple functions for which the theorem 
has been already established, we have 

f = f /»(£>»?) »?); 

•' s' J s 


(3.33) 
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hence, by combining (3.31)—(3.33) we get 
J sl *-*00 J s 

The equivalent result for h(£) can be derived by the same procedure. 
With the theorem established for nonnegative functions, it is trivial 
to extend it to real and then complex functions by using Definition 3.5. 

Q.E.D. 

It is essential to realize that when we are presented with an iterated 
integral 

(3.34) f 

J s x J s 2 

we cannot apply immediately Fubini’s theorem to invert the order of 
integration in £ and 77 . We could justify such a procedure only if we 
knew not only that ( 3 . 34 ) exists, but also that/(£, 77 ) is integrable on 
S 1 X S 2 in the measure /x x X /x 2 . The following theorem frequently 
provides help in such situations. 

^Theorem 3*14 (ToneZ/fs theorem ). Let (^, and (^, j/ 2 > ^ 2 ) 

be a-finite measure spaces and ( 2 £ X j/, /x) their Cartesian product. 
Suppose the function /(£, 77 ) on S 1 X S 2 C SC X is measurable on 
S 1 X S 2 , that |/(£, 77)1 is integrable on S 2 for all £ e S 1 , with the 
possible exception of a set R 0 C S 1 with /x 1 (i? 0 ) = 0, and that the iterated 
integral 

(3.35) f d^)\ dfM 2 (rj) |/(£ 77)1 

J S^Ro J S 2 

exists. Then the integral 

(3.36) f f((,ri)Mi.v)= \ 2 (v) 

J S1XS2 J S1XS2 

also exists. 

Proof . Let g(£, 77 ) be a function which is positive and integrable on 
S 1 X S 2 ; for example, if S (1) , S (2) ,... are sets of finite /x measure which 
cover* S 1 X S 2 , i.e., S (1) U S {2) U ••• == S 1 X S 2 , then the function 

g(t v) = Z (1 In 2 ) b^S^XsMd, V) 

n 

provides an adequate choice if fi(S in) ) > 0 for all n . 

* Recall that X and therefore also S x X S 2 , is of cr-finite fi measure, and therefore 
such a covering exists. 
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Let us introduce the functions 

Mi, v) = mf{l f(i, 1 J)I, Mi, r,)}, n = 1 , 2,.... 

The function f n (£, 77) is measurable, since it is equal to the measurable 
function ng(£, rj) on the measurable set (see Lemma 3.1) 

{(£ v) :n g(Lv) </(£*?)} 

and equal to /(£, rj) on the complement of that set. Moreover, by 
Theorem 3.9 f n (g, 77 ) is integrable in S 1 X S 2 , since |/ n (£, 77)1 < ng (£, rj) 
and g(£, rj) is integrable on X S 2 . Hence, by Fubini’s theorem, the 
function f n (£> rj) is integrable in 77 over S 2 , with the possible exception 
of a set R n C S 1 with /x 1 (i^ n ) = 0. 

Let us introduce the set R 0 — R 1 U R 2 U ••• of measure zero. 
All functions / n (£, rj) are integrable on (S 1 — R 0 ) X S 2 . Since 
fn(L V) ^ |/(£> 7 ?)l> we obtain by applying Fubini’s theorem and 
Theorem 3.9 

f Mi, v) d Ki, 1 ?)=f d Mi) f Mi, v) d Mv) 

J J Sx-Rq j S 2 

d Mi) f \f(i, ■'))! 

J s 1 ~R„ J s 2 

which shows that the integrals of / n (£, 77) on S x X S 2 are bounded. 
Hence, Theorem 3.10 can be applied to infer that since \f(£, ?])\ — 
lim/ n (£, 77), the function |/(£, 77)1 is integrable on S x X S 2 . Q.E.D. 

If the conditions of the above theorem are fulfilled, then the order of 
integration in (3.34) can be inverted. This conclusion can be reached 
by using Tonelli’s theorem to infer that/(£, 77) is integrable on S x X S 2 , 
and then applying Fubini’s theorem to justify the reversal of the order 
of integration. 

Exercises 

3*1 ♦ By employing Lemma 3.1, show that if the functions /(£) and 
g(£) are measurable, then the sets 


are measurable. 

3*2* By employing the same procedure as in proving Lemma 3.2, 
show that an extended real-valued function is measurable if and only if 



100 


II* Measure Theory and Hilbert Spaces of Functions 


for every real nu nber c the set {f: — oo =<C/(f) ^ c} is measurable. 
Prove that the same statement is true if we consider instead of the sets 
{f: — oo </(f) ^ c}, the sets {f: c ^/(f) < -f 00 }, or the sets 
{£■ c </(£) < +oo}. 

3*3* Prove that if/i(f),/ 2 (f)>- . is a sequence of extended real-valued 
measurable functions on the measurable space SC, then the functions 
sup„ =1>2> .../„(£) and inf w=12i /„(£) are also measurable. 

3.4. Show that for the simple function defined on R = [0, 1] by 

_ (4 for x irrational 

(2 for x rational 

the Lebesgue integral over R exists and 

f /(*)<W*) = 4. 

J R 

3*5* Derive directly from the Definition 3.3 that if ^(f) and s 2 (f) 
are two simple functions integrable on R, then 

f (hit) + s 2 (i)) <*/x(f) = f *i(f) d^O + f * 2 (f) <//x(f), 

J R J R J R 

and that in case s ^f) ^ s 2 (f), f G we have 

f hd)d^)> f *.(f)<Wf)- 

J R J R 

3*6* Show, by using only the Definition 3.3, that if R x and R 2 are 
two disjoint measurable sets and the simple function s(f) is fx integrable 
on R 1 and R 2 , then s(f) is fx integrable on R x U R 2 and 

f «f)<Wf)=f K0M€)+ f <0M0- 

J RiVR 2 J R t J R 2 

3*7* Explain why a real function f(x) is fx integrable on a set R if 
and only if it is absolutely [x integrable on R, i.e., if $ R \f(x)\ dfx(x) exists. 

3 *8* Show that if/(f) and £(f) are fx integrable functions on jR, then 
for any a, b e C 1 , the function af ( f) + bg{ f) is fx integrable in R and 

f mo + wo) mo =«f /(o dtio + h f g(o *tio- 

J R J R J R 
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3*9* Show that if /(£) ^ g(£) on the measurable set R , and if f{£) 
and g(£) are integrable on R y then 

f m Mi) > f Ai) Mi)- 

J R J R 

4* Spaces of Square-Integrable Functions 

4.1. Square-Integrable Functions 

Let Q be a Borel subset of IR n . Denote by SS 7 ^ the family of all Borel 
sets in R n which are subsets of Q . Clearly, SS 7 ^ is a Boolean a algebra 
(see Exercise 1.8). Let /x( B ), B e SS 7 ^ , be a measure on , and denote 
by Q the measure space ( Q , SS 7 ^ , /x). 

Definition 4*1 ♦ We say that an extended complex-valued function 
/(#), x e Q, defined almost everywhere on Q is square integrable whenever 
f(x) is measurable [i.e., \f(x )\ 2 is also measurable by Theorem 3.3] 
and |/(x)| 2 is integrable on Q, i.e., 

f \f(x)\ 2 dfx(x) 

J £2 

exists and is finite. 

If a square-integrable function f(x) is not defined everywhere on .Q, 
we can extend it to Q by writing f(x) = 0 at the points x where it was 
not previously defined. Hence, we shall assume in the sequel that each 
square integrable function on Q is thus suitably defined everywhere in Q . 

A function that is square integrable is not necessarily integrable! 
Witness the case of the function/(#) = x~ 3 / 4 , xeQ — [1, +°°)> for 
integrals in the Lebesgue measure on : 

f \f(x)\ 2 d^\x) = f x~* /2 dx = 2, 

J tl.oo) J 1 

f f(x)dfi[ 1 \x) = f x _3/4 dx = +oo. 

J [I.®) ■J 1 

However, in the case < +oo, a square-integrable function will 
also be integrable. 

Theorem 4*1 ♦ If the measure /x(i?) of a measurable set R is finite, 
then every function/(£) which is square integrable on R is also integrable 
on JR. 
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Theorem 4.1 can be deduced immediately from Theorem 3.9 and 
the inequality 

1/(01 < id + l/d)l 2 ). 

Theorem 4.2. The set L( 2 )(£?, /x) of all complex-valued functions 
which are square integrable on Q is a complex vector space with respect 
to the vector operations 

(4.1) («/)(*) = «/(*), a e C 1 , f(x) eL i2) (Q , /x), 

(4.2) (/ + g)(x) =f(x)+ g(x), /(*), g(x) e L i2) (Q , /x). 

Proof. lff(x) is square integrable on Q then af(x) is obviously also 
square integrable. 

In order to establish that f(x)-\-g(x)eL( 2 )(Q,fji) whenever f(x), 
g(x) eL( 2 )(i2, n) it is sufficient to use the easily verifiable inequality 

I fix) +g(x) I 2 < 2(|/(x )| 2 + |^)| 2 ) 

in conjunction with Theorems 3.5 and 3.9. 

Once it is established that the operations (4.1) and (4.2) leave L( 2 )(^3, /x) 
invariant, it is a trivial task to check that all the axioms for a vector 
space, given in Chapter I, Definition 1.1, are satisfied, if in addition we 
take as the zero vector the function/(#) = 0, x e Q. Q.E.D. 

The family of all spaces L( 2 )(Q, /x) includes as a special case the vector 
space Z 2 (oo) introduced in Chapter I, §4; Z 2 (oo) is obtained when we 
take Q = {1, 2,...} and when the measure /x is defined (uniquely) as the 
or-additive set function for which 

/*({*}) = 1 , n = 1 , 2 ,.... 

We get by applying Theorem 3.11 

f \f(x)\ 2 dn(x) 

= £ f \m\ 2 <//x(*) = z i/(n)i 2 /x(W) = z i/wi 2 - 

n=1 J ( n ) n=1 n=1 

Lemma 4.1. Two extended complex-valued measurable functions 
/(£) and g(£), defined on the measurable set R in a measure space, are 
equal almost everywhere in R if and only if 

f l/(f)-*( 0 l , ‘WO*=o. 

J R 


(4.3) 
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Proof. If the above two functions are equal almost everywhere, 
then 1/(0 — #(£)! = 0 almost everywhere and (4.3) follows from 
Theorem 3.8. 

Conversely, if (4.3) is true, then for any integer n > 0 the set 
> 1 fn} n R has to be of measure zero, because 
since | /(£) — g (£)I > (1 /n) xs (0> we ^ ave as a special case of Theo¬ 
rem 3.9, 

0 < riSn) = f XsJii) drii) < n 2 f \f(ft -g(ft |» d^g) = 0. 

J R J R 

Consequently 

kh- i m -m > o}) = m (0 s «) < t = °- Q- E - D - 

\W=1 1 n =1 

4.2. Hilbert Spaces of Square-Integrable Functions 

It is easy to check (see Exercise 4.1) that the relation of “being almost 
everywhere equal” that can hold between pairs of functions from 
L( 2 )(Q, fi) is an equivalence relation. It is customary to denote the family 
of all equivalence classes of functions which are almost everywhere 
equal and belong to L( 2 )(Q r p) also by the symbol L( 2 )(Q> /x), but in 
order to avoid confusion, we shall denote that family by L 2 (Q, /x). 

We denote the equivalence class which contains the square integrable 
function f(x) by/. Thus, iff(x) and g(x) are almost everywhere equal, 
then / and g are identical equivalence classes, and we can write / = g; 
conversely, f — g implies that f(x) == g(x) almost everywhere. 

Theorem 4 3* If f(x), g(x) eL( 2 )(Q, fi) belong to the equivalence 
classes /, g e L 2 (Q y /x), and a is a complex number, denote by af and 
/ + g the equivalence classes containing the function af(x) and 
f(x) + g( x )> respectively. The operations 

(«,/) - af, (a,f) e C 1 X L\Q, /x), afeL*(Q, /x) 

(4 ' 4) (/,*)-/ + *, (J,g)eL\Q,^) X L 2 (Q, jj), f + geL^Q,^) 

are vector operations, and the set L 2 (Q, /x) with these vector operations 
becomes a Hilbert space when the inner product of /, g e L 2 (Q , /x) is 
defined by 

(4.5) </k> = f /*(x) g(x) dp.(x). 

The proof of this theorem, which is the central theorem of this 
section, will be given in a few stages. 
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First we have to note that (4.4) are indeed mappings, i.e., that af and 
/ -f- g are uniquely determined by f and g. That is quite obvious, because 
if f x = f 2 and g x = g 2 , then af^x) — af 2 (x) almost everywhere, i.e., 
a f 1 — a h \ an d since 

(4.6) {x: f^x) + g x (x) ^ U{x) + g 2 (x)} 

({ X: fl( X ) 7^ AW} ^ {^* gl( X ) ^ gifa)})* 

the measure of the set on the left-hand side of (4.6) is zero because the 
measures of both sets on the right-hand side of (4.6) are zero; therefore, 
fi( x ) + gi( x ) = f 2 ( x ) + &(*) almost everywhere, i.e.,^ + g ± = / 2 + g 2 • 
From Theorem 4.2 we know that af(x),f(x) J rg(x)eL( 2 )(Q,fi) 
whenever/(#), g(x) gL( 2 )(Q , /x), so that both mappings (4.4) are defined 
for all (*,/) e C 1 X L 2 (Q y p) and (/, g) eL 2 (Q, p) X L 2 (Q y p). Theo¬ 
rem 4.2 implies in an obvious way that we deal indeed with vector 
operations, if we adopt as the zero vector in L 2 (Q, /x) the equivalence 
class of square integrable functions that vanish almost everywhere on Q . 
From the inequality 

we can immediately derive 

(4.7) \f*(x)g(x)\ = |/(*)| | *(*)| < i(\f(x)\ 2 + | g(x)\ 2 ). 

Therefore, the function \f*(x)g( x )\ is integrable on Q if/(#) and g(x) 
are square integrable on Q , as a consequence of Theorems 3.3 and 3.9. 
Thus, f*(x)g(x) is integrable on Q (see Exercise 3.7), and (4.5) is 
defined for any /(#), g(x) eL( 2 )(Q, [jl). Furthermore, </ | £> is uniquely 
defined for any/, g eL 2 (Q, fi) because if/i(#) = f 2 ( x ) almost everywhere, 
then 

f (fi( x ) —M*))*g(x)M x )) = f (fi( x ) - M x ))* g( x ) M x ) = 0 , 

J n J R 

R — {xif^x) ^ / 2 (^)}, 

by Theorem 3.8. It is easy to check (see Exercise 4.2) that (4.5) satisfies 
the axioms for an inner product. 

Thus far we have proved that L 2 (Q, fx) is a Euclidean space. We turn 
now to proving the completeness of L 2 (Q , fx). 

Definition 4.2. A sequence of functions f^x), f 2 (x) y ... e L( 2 )(Q y fx) 
is said to be fundamental in the mean if for any e > 0 there is a positive 
integer AT(e) such that 


(4.8) 


f I /mW - f n (x)\* dn(x) < € 
J a 
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for all m, n > N{e). We say that such a sequence converges in the mean 
to f(x) € L( 2) (Q, /J.) if 

lim f \f(x) —f n (x)\ 2 d/^x) = 0, 

n-»+oo j q 

and in that case we write 


f(x) = l.i.m./ n (*), 


where “l.i.m.” stands for “limit in the mean.” 

According to (4.5) for any f(x) eL( 2 )(Q, fi) 

ll/ll 2 = </!/>= f \f{x)\ 2 d^{x). 

J Q 

Hence, iff 1 (x) 9 f 2 (x) 9 ... is fundamental in the mean, then ,/ 2 ,... eL 2 (Q, /x) 
is a Cauchy (or fundamental) sequence in the norm in L 2 (Q, /x), and con¬ 
vergence in the mean in L( 2 )(Q, /x) corresponds to convergence in the 
norm in L 2 (Q, /x). 

^Theorem 4*4 ( Riesz-Fischer theorem). If a sequence f 1 (x) 9 
f 2 (x),... eL( 2 )(Q, [jl) is fundamental in the mean, then it converges in 
the mean to a function/(#) eL( 2 )(Q, / x ). 

Proof. If f 1 {x) 9 f 2 {x) 9 ... is fundamental in the mean, by taking in (4.8) 

n k = l +N(IIS«), k — l, 2,..., 

we get a nondecreasing sequence n x ^ h 2 ^ ••• for which 

(4.9) f | f nk Jx) - f nk (x)\ 2 drfx) < l/8 fc . 

J Q 

Since fi(x), f 2 (x),... are /x-measurable functions, the sets 

(4.10) = {*: | f nk+1 (x) -f nk (x) | > 1/2*} 

are /x measurable. Furthermore, by (4.9) 

f I fn k+1 (x) - f nk ( X )\ 2 dll(x) < [ \fn k+1 (x) - f nk (x)\ 2 dfj.(x) < 1/8* 

J S k •> Q 


and since 


/ I/«.„(*) - fn k ( x ) 1 2 Mx) > 0/4*) KSu). 

•> s„ 
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we have 

(4.11) ^S k )<ll2K 
Therefore, the sets 

(4.12) R k = S k u S k+ 1 u k — 1, 2,... 

have measures 

(4.13) fji(R k ) < l/2 fc + 1/2** 1 + •- = 1/2*" 1 . 

A point x 1 e Q belongs to R k if and only if for some m ^ k 

I -fnji* l)l > 

Therefore, a point x e Q does not belong to R k if and only if 

fn m+1 (x) -f n Jx)\ < l/2 m , m = k,k -I- 1,... . 

Thus, if x e = Q — R k , then the series 

(4.14) /«,(*) + (/«,(*) -fnjix)) + (f n2 (x) -/„(*)) + - 

converges absolutely and uniformly on R k . Since this is true for every 
k = 1, 2,..., we conclude that 

(4.15) f(x) = \imf nm (x) 
everywhere on 

00 / QO W 00 

u/?»'= n ^ * = 

k =1 \fc=l / fc=l 

It is obvious from (4.12) that R X D R 2 D ••• . Consequently, due to 
the continuity from above of every measure (Theorem 2.1), we get by 
using (4.13) 

fji(R) = lim fi(R k ) < lim l/2 fc = 0, 

«->oo «->oo 

i.e., R is of measure zero. 

The function/(x) is defined everywhere on R' by (4.15), and being 
the limit of functions measurable on R' it is also measurable on R' by 
Theorem 3.4. We can extend the domain of definition of /(x) to Q by 
writing /(x) = 0 for x e R. The function so derived is still measurable 
because R is measurable. 
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Since f Ul ( x ), fn 2 ( x ),--- converges uniformly to f(x) on R k , the con¬ 
vergence 

(4.16) g nl (x) = I f n (x) —f ni (x) I 2 , g n2 (x) = I f n (x) ~ fn 2 { x )| 2 ,... 

-*£»(*) = I /»(*) -f(x )| 2 

is also uniform on R k ' for any fixed n — 1, 2,... . By assumption 
Snm( x ) == I f n ( x ) ~ fn m ( x ) I 2 are integrable on Q , and therefore also on 
/?*.', so that the set 

00 

Z* 0 = U (X'-gnUx) >0}-R k 

m=1 

is of or-finite measure, and can be therefore written as the disjoint union 
of sets Tffi, of finite measure. By Theorem 3.11 we can write 

(4.17) f g nm (x) djx(x) = f g nm (x) dfJ.( X ) = £ f gnm(x)M X )- 

J Rk - J r <n, , =1 j T rn 

Due to the uniform convergence on R k of the sequence (4.16), for each 
€ > 0 there is an N n (e) such that 

I gnm(x) — gn(x)\ < ™ > #»(«) 

for all x e R k . Thus, for every i = 1, 2,..., 

f , I gnm(x) — g n (x) \ d^(x) < <^( Tif), m > N n (e), 

J rp(n) 

1 ki 

and consequently 

(4-18) f g n (x)d/x(x) = lim [ g nm (x) d/x(x). 

J rji(7l) TYl-^Q 0 J rji(7t) 

ki ki 

Therefore, by using (4.17), (4.18), and Theorem 3.11 we get 
(4.19) lim f g nm (x)dix(x) = lim f g nm (x) d/x(x) 

co J £ / wi~-*co J rji{n) 


= i i Ttr Jndx)d^x) 

4 = 1 J ki 

°° /. 

= Z gn{x)dfX.{x) 

VI j T [n) 

1=1 J ki 

= f ,£*( x ) d i4 x ) = f gJx)M x )- 

J rp\n) J D ' 
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On the other hand, since the sequence /i( x), / 2 ( x),... is fundamental 
in the mean, for any e > 0 there is an N(e) such that when n, n m > N(e), 
we have 

f g«m( x )M X ) = f I /«(*) -fn m ( x )\ 2 M X ) < e - 

•> a J a 

Since g nm (x) > 0, we obtain that when n, n m > N(e) 

f gnm( x ) M X ) = f gnm( x ) d K X ) < e - 

jR k ’ J C 

Consequently, by using (4.19) we conclude that for n > N(e) 
f I /„(*) -/(*)I 2 M X ) = f gn(x) d K X ) = f gnm( x ) M X ) < « 

J R k ' J 

for every k = 1, 2,... . As we have 

fl - R = Q u (/?,' - R,!) u (22*' - «.') u 

fc=l 

we obtain by employing Theorem 3.8 (note that /x(i?) = 0) and Theo¬ 
rem 3.11 

f g n (x) M x ) = f <*/*(*) 

J Q J S2-R 

0° 

= f gn(x) Mx) + Y I , ,g«( X )M X ) 

J Rl k =1 J R k+l~ R k 

= lim f £»(*) 

i.e., for n > iV(e), 

(4.20) f I AM “ /Ml 2 < e - 

J £2 

Since we can write 


IAWI 2 l/(*)l* + I AW -/(*)!'. 

(4.20) implies that /(*) is square integrable and that /i(*),/ 2 (*),— 
converges to f(x) in the mean. Q.E.D. 

From Theorem 4.4 we can immediately conclude that L\Q, p.) is 
complete, because if eL*(0, j*) is a Cauchy sequence, then 
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f 1 (x) y f 2 (x) > ...eL( 2 )(Q > [jL) is a sequence which is fundamental in the 
mean; therefore, by the Riesz-Fisher theorem y f 1 (x) y f 2 (x) 9 ... converges 
in the mean to a function f(x) eL( 2 )(Q 9 /x), and consequently f x ,/ 2 
converges in the norm to feL 2 (Q, (jl). 

We note that in the course of proving Theorem 4.4 we have also 
obtained the following results. 

Theorem 4*5* If f 1 (x) y f 2 (x) y ... gL( 2 )(Q 9 [jl) is a sequence which 
converges in the mean to f(x) eL( 2 )(Q , /x) then there is a subsequence 
/ ni (#),/ na (#),... of the original sequence which converges almost every¬ 
where point by point tof(x), i.e., there is a set R of measure zero such 
that 

lim f n (x) — f(x\ xe R' — Q — R. 

m-+ oo m 

Furthermore, there is a monotonically decreasing sequence of sets 

00 

R^R.D-DR, R=f]R k , 

k=1 

which is such that f n (x)> f n fx) y ... converges uniformly to f(x) on each 
Rjc — & ** • 

*4.3. The Separability of L 2 Spaces 

Thus far we have proved that L 2 (Q, /x) is a Hilbert space, i.e., we have 
proved the key Theorem 4.3. There are, however, spaces L 2 (Q, /x) which 
are not separable (see Exercise 4.6). We would like to investigate under 
what conditions L 2 (Q , fi) is separable. In order to do that, we need the 
following theorem. 

Theorem 4*6* The vector subspace (C fi ) of L( 2 )(Q, /x), spanned* by 
the family — {xb{ x )> B g ^ n) > KB) < + °o} of all characteristic 
functions Xb( x )> x g °f Borel sets B C Q of finite measure, is every¬ 
where dense in the mean in L( 2 )(Q , /x), i.e., for a given f(x) eL( 2 )(Q , /x) 
and any € > 0 there is a function s(#) e C Q such that 

(4.21) f | f(x) — 4*01 2 dfju(x) < e. 

Proof . Due to Definition 3.5, it is obviously sufficient if we prove 


# Note that Cq is identical to the family of all complex simple functions which are 
tiileunihlc on Q. 
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the theorem for nonnegative functions f{x) ^ 0 from L^){Q y /x). Given 
such a function f(x)> the set 

R 0 = {#: f(x) > 0} 

has to be of or-finite measure because \f(x)\ 2 is integrable on Q , and 
therefore on R 0 . Thus, we can write R 0 as a disjoint union 

— (J**> < +°°j k = 1, 2,..., 

k=1 

of sets of finite measure. By Theorem 3.11 


f \nx)\ 2 d^x)= f i/(*)i»^) = f f i/(*)i*^). 

J a J r q k=1 J R k 


Consequently, given an € > 0, there is an integer N(e) ^ 0 such that 
(4.22) £ f |/(*)| 2 4*(*) <c/2. 

k=N(€)+1 J R k 


Since f(x) is square integrable on R k and /Ji(R k ) < +oo, by 
Theorem 4.1 f(x) is also integrable on R k . Therefore, according to 
Definition 3.4, there is a nondecreasing sequence s ( 2 k) (x),... of 

integrable simple functions vanishing outside R k such that 



for xe R k 
for x e R k . 


Consequently, | f(x) — |/(#) — * s a nonincreasing 

sequence on R k for which 


lim \f(x) - »«(*)!» = 0, xeR k . 


Therefore, by Theorem 3.10 

lim f | f{x) - 4' c) WI 2 dfx(x) = 0 

n ^°° J R k 

so that there is an n — n 0 for which 

\f(x) - s ( %(x)\* d,,(x) <<I2N(*). 

R k 


(4.23) 
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The function 


s(x) — 


I s£(x) 


is obviously an element of , for which we have, because of (4.22) 
and (4.23), 


f I/M ~ s{x)\ 2 d l x{x) 


N(c) 


I/M — sMI 2 dfx(x) 

Rh 


+ i f |/Ml 2 dfji(x) < e. Q.E.D. 

k=N(c)+l J Rk 

Consider now the family J™ of all closed intervals > 

(4.24) , ftj x - x [a n , 6J 

with rational end points, i.e., a x ,..., a n , ^,..., b n e 91. Denote by 
the subset of C^ consisting of all characteristic functions x*M> x G 
corresponding to all sets R = (7j U ••• U I m ) n £?, where I x ,..., J w e i/'™, 
i.e., 

(4.25) C£= {xj?M : * - (h u - u / m ) n fl, J,,..., I m eJ? n r ,m = l, 2,...}. 

We claim the following. 

*Lemma 4*2* If the measure /x( B ), 5 e > is 07 finite in Q y the 
family C r Q defined in (4.25) is everywhere dense in the set C Q C L( 2 )(Q, (jl ), 
i.e., for every B e and any € > 0, there can be found a x*M g Ca 
such that 

(4.26) f | xb(x) — XsWI 2 dfx(x) < €. 

J Q 

Proof. Note that | x^M ~ Xj?(^)I can assume only the values 0 and 1 
and consequently 

(4-27) f I xb(x) - x*(*)l 2 d lA x ) = f I Xb( x ) - X*(*)l M x ) 

J Q J Q 

= n(B -R)+ p(R - B) = /*(£ A R). 

Consider now the Boolean or algebra ^/(^) generated by the family 
.'#{! of all finite unions of intervals with rational ends. Since any interval 
1 1 J n can be written as a countable union of disjoint intervals with 
rational end points, we have ja^(^) D J n . Consequently 





112 


II. Measure Theory and Hilbert Spaces of Functions 


because D 39* . Thus, we also have 

s^Cq) ={Br\Q: Be stjffl)} = {B n Q: B e = 3g* , 

i.e., 38^ is identical to the Boolean or algebra generated by C r Q . 

We can easily establish (see Exercise 4.7) that C r Q is a Boolean algebra. 
According to Theorem 2.3, the set function 

m = inf j £ ,j.{R k ), 0 R k -DB,R ly R 2 ,... e cd, Be 88% , 

(fc=l fc=1 ) 

is a measure on the Boolean a algebra generated by the Boolean 
algebra C£ . Furthermore, if /x is a cr-finite measure, then \l(B ), B e C£ , 
has a unique extension to s0 that /x and /I are identical: 

/1(B) =/*(£), 

From the definition of /x(J5) it follows that for any € > 0 we can choose 
R x , >••• G so that 

m < Imb*) < mb) + «/2. 

fc=i 

If we write 

W 00 

= U = U ^ ’ 

k=l k=1 

we have S 1 C S 2 C , and since B C S, 

lim u,(B A *S' w ) = lim u(B — < S , W ) + lim /x(*S w — B) = fji(S — B) 

n-»oo n->oo n-400 

= M* 5 ) — MB) < £ MB*) — MB) < e/2. 

fc=l 

Consequently, there is an n 0 for which 

MB A S„ 0 ) < e/2, 

and therefore, for R = S n we have 

ti(B A R) < e. 

The above result in conjunction with (4.27) proves that x*(*) satisfies 
(4.26). Q.E.D. 
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*Lcmma 4 3* If /x is a cr-finite measure, the family £) s of all simple 
/x-integrable functions 

m 

s{x) = £ a kX R k (x), RiR m sCl, m = 1, 2,..., 

1 

with e C 1 having rational numbers for their real and imaginary 

parts (i.e. Re , Im Re a m , Im a m e 91), is a countable subset of 

L( 2 )(Q, fji) which is everywhere dense in the mean in L( 2 )(Q , /x). 

Proof . The result that is countable can be deduced easily (see 
Exercise 4.4) from the statement that the set 91 of all rational numbers 
is countable. 

If f(x) eL( 2 )(Q, fji), according to Theorem 4.6 we can find an integrable 
simple function s(x) e (C Q ) 

m 

s ( x ) = X c kXBp), Bi = 0 for i ^ j, 

1 

&,(*).»•, XiJ*) 6 Co 

such that 

(4.28) ||/ - s || 2 = f |/(*) - <*)| 2 <W*) < c/9. 

We can now choose complex numbers a x ,..., a m with real and imaginary 
parts which are rational numbers so that 

I tf/c ~ c k I 2 < e/9m/x(jS fc ), | a k | > 0. 

The simple function 

m 

h( X ) = X a kXB k {x) 

1 

is integrable because $(#) is integrable, and 

r m 

(4.29) || f — *! || a = J | s(x) - sj(x)| 2 dix{x) = £ | - c k | a < e/9. 

•'« fc-i 

Finally, according to Lemma 4.2, for each XB k ( x ) we can find a 
Mich that 

W" W ) II Xfl* - XX k IP = J fl I XB h (x) - x*»l a M*) < 9^2j^ -j 2 . 

k = 1 ,..., m. 
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We see that 

m 

*r(x) = Z a kXR,( X ) 

1 

belongs to £) s , and since || h ||, h(x) eL( 2 )(Q, /z), is a norm in L 2 (Q, /z), 
by applying the triangle inequality and using (4.30), we get 


II h “ *rll = 


m 

E a k(XB k ~ XR k ) 


m _ 

< E I ! II XB k — XR k II < V7l3. 


1 fc=l 1 A:—1 

Thus, by combining this relation with (4.28) and (4.29), we arrive at 

II/ —MKII/-*!l + IU-*ill + ll*i-M! < V7, 

i.e., we have an s r (x) e D s for which 


11/ — ^ II 2 = f l/W ~ *r(*)l a dp(x) < e. Q.E.D. 

Lemma 4.3 establishes the separability of L 2 (Q, (jl) when fi is or finite 
because the subset & s of all equivalence classes s e L 2 (Q , p) containing 
a function s(x) e is obviously a countable everywhere dense set in 
L 2 (Q , jit). Thus, we have proved the following theorem. 

Theorem 4*7* The Hilbert space L 2 (Q, (jl) is separable when the 
measure fz(B), B e , is or finite. 

The following statement can be immediately deduced from Lemma 4.3 
by noting that every finite union R k of finite intervals can be written as 
a finite union of disjoint finite intervals. 

Theorem 4*8* If the set Q is of or-finite ^ measure, the vector 
subspace spanned by all the characteristic functions Xr( x )> x e Q, 
corresponding to the sets R — I n Q, where I e J> n is a finite interval, 
is everywhere dense in L 2 (Q, (jl ). 

A special class of Hilbert spaces L 2 (Q, (jl) which are of great importance 
in quantum mechanics corresponds to the choice (jl(B ) = ^^(B), 
B e , where /z ( f n) is the Lebesgue measure on . The Hilbert space 
L 2 (Q, fjb\ n) ) is separable because is a a-finite measure. Such a Hilbert 
space L 2 (Q , ^\ n) ) is simply denoted by L 2 (Q ), and correspondingly 
L( 2 )(Q, fjb\ n) ) is denoted by L( 2 )(Q). 

In the case Q = 1R 71 , the setL( 2 )(lR rl ) is the space of all Borel measurable 
functions f(x), for which the Lebesgue integral 



f \m\ 2 d^\x) 

J (F8 n 
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exists. The subset L^(U n ) of all geL^(U n ) corresponding to functions 
g(x) whose Riemann integral J** ••• J** \ g{x)\ 2 d n x exists and is finite 
is obviously a linear subspace of L^(U n ). Due to Theorem 3.7 the 
Euclidean space L#(U n ) is isomorphic to the space ^ ( 0 2) (lR rl ) of all con¬ 
tinuous functions which are square integrable on R n . We know that 
^ 2 )(^ 1 ) ^ incomplete (see Chapter I, §4), and the same can be proved 
about 4 7 ® 2 )(lR rl ). Thus, the space L^(R n ) is incomplete, which shows that 
the introduction of the Lebesgue integral is essential for the validity of 
the Riesz-Fischer theorem, and thus for the construction of complete 
Euclidean spaces of square integrable functions. 

4.4. Change of Variables of Integration 

In order to prove the next theorem we need the following lemma. 

*Lcmma 4*4* If the functions ^> 1 (^),..., <f> n (x ), x = (, x 1 ,..., x n ) e B 0 , 
mapping the Borel set B 0 e & n into R n , are Borel measurable, then the 
mapping 

X > X 1 = (j. >(x) = <^ w (x)) 

is a Borel measurable mapping , i.e., the inverse image <f>~\B) of any Borel 
set B e & n is again a Borel set. 

Proof. For an n-dimensional interval I — I 1 X • • • X I n , I x ,..., I n e ^, 
we can write 


4>-\i) = n fc'ih). 

k—1 

Since <f> k (x) is a Borel measurable function, the set is a Borel set; 

hence, </> _1 (7) is a Borel set for any interval I e J> n . 

Denote by the family of all sets S for which ^>~ 1 (S') e & n . is a 
Boolean or algebra: if S 1 , S 2 ,... e , then 

u S 2 u •••) = ^(^j) u ^-\S 2 ) u - e @ n . 


i.e., S { US 2 U and for any S e 

i.e., S' . Since we have already established that D and since 
:hf n is the smallest Boolean a algebra containing J n , it follows that 
it* C , i.e., 0 \B) e M" for any B e St*. Q.E.D. 
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Theorem 4*9* (a) Let Q be a Borel subset of [R n whose boundary* 

is of Lebesgue measure zero and on which a Lebesgue-Stieltjes measure 


fi(B) = I p(x) d n x , p(x) > 0, Be , 

J B 

is defined, and let x -> x 1 — <f>(x) be a mapping of Q onto the set Q L C [R n , 
given by 

Xi = <f>\(Xi # n ), 

(4.31) x n — <j> n {x\ # n ), 

X = (#1 # n ) £ r = (#j ,•••, #72,) G 


where ^> 1 (#),..., </> n (#) are almost everywhere in Q continuously differen¬ 
tiable. If the Jacobian of the transformation 




2k.. 

..ah 

Wl... 

•><£«) _ 

dx x 

8x n 

8(x x 

•> x n) 

Hn .. 




dx 1 

8x n 


1 

/V) 


is almost everywhere positive, then 

(4.32) ^(B 1 ) ^ f d^x) = f J\xi)p(</>-\xi))d"x\ B'e<%l i, 

J J B 1 

determines a Lebesgue-Stieltjes measure on . 

(b) If the mapping — </>(#) has almost everywhere in Q an 

inverse x 1 x = ^~ 1 (# 1 ), then for each function/(#), # g £?, the function 

(4.33) fi(x 1 )=f(<t>~i(x 1 )) 

is defined almost everywhere on fl 1 , and is /x 1 integrable on B 1 if f(x) is 
pi integrable in B — ^>~ 1 (B 1 ); we have 

(4.34) f /X# 1 ) ^/x 1 ^ 1 ) = f /(#) dpL(x), B 1 g && . 

^ b-itHb 1 ) 

* Recall that a boundary point of a subset of a metric space is any point which is not an 
interior point but it is an accumulation point of that subset; an interior point of a set O 
is a point which has an e neighborhood which lies in Q, 
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The mapping 

(4.35) /-A, PeL\Q\t,\ 

is a unitary transformation (see Definition 2.4 in Chapter I) of the 
Hilbert space L 2 (Q y p) onto the Hilbert space L 2 (Q 1 y p 1 ). 

Proof, (a) Denote by Q 0 the set of all interior points x e Q at which 
</>!(#),..., (f> n (x) in (4.31) are continuously differentiable and at which 
J(x) > 0. If # g Q 0 , then f^x),..., <f> n (x) are continuously differentiable 
and J(x) > 0 also in some neighborhood of thus, the set Q 0 is open. 
According to the assumptions, p(Q — Q 0 ) = 0. 

The set Q 0 is mapped by (4.31) onto a set £?£ C Q 1 . Since is 

continuous in Q 0 , it is also bounded and therefore integrable on every 
compact set B C Q 0 of finite measure. According to Lemma 4.4 the 
inverse image B y under the mapping (4.31), of every Borel set B 1 C £?£ 
is also a Borel set, due to the fact that <f> i(#),..., <j> n (x) are continuous and 
therefore measurable by Theorem 3.1. 

Thus, (4.32) really determines a measure on Q\ , which can be extended 
to Q 1 if we write 


- Q\) = 0 . 

(b) It is easy to see that the function/ 1 ^ 1 ) — /(^~ 1 (^ 1 )) is p 1 measur¬ 
able on* Q 1 if f(x) is p measurable on Q. In order to establish (4.34), 
note that for 

A* 1 ) = B'z@l i, 

we get from (4.32) that (4.34) is equivalent to the relation 

(4.36) /iW = rt<t>~W)). 

If we choose B 1 to be an open subset of fl 1 , then the set B is also open 
if B denotes the inverse image f^B 1 ) of B 1 after a possible subtraction 
from </> _1 ( J B 1 ) of the set f^B 1 ) n (Q — Q 0 ) of measure zero. When B 1 
is an interval in Q y we can deduce from well-known theorems of the 
calculus that (4.36) is true by noting (first for continuous p y then 
generalizing) that (4.36) can be written in the form 

( 4 . 37 ) f K ^" 1 ^ 1 ))/ 1 ^ 1 )^ 1 = f p ( x ) d n x . 

J B 1 J B 

# f l (x l ) is defined almost everywhere by the relation f 1 (x l ) — /(^(jc 1 )). On a remaining 
net of fi l measure zero we can define /‘(a? 1 ) arbitrarily. 
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Thus, due to the additivity of ^(B 1 ) and /x(i?), (4.36) is also established 
for any set B 1 which is a finite union of disjoint intervals lying in Q x y i.e., 
on the Boolean algebra 38q Q generated by the family J>% of all intervals 
lying in Q (see Exercise 4.8). Since ^(B 1 ) is a a-finite measure and the 
Boolean a algebra generated by * s ( see Exercise 4.8), we 

conclude from Theorem 2.3 that (4.36) is true for any B 1 e 38%i . Thus, 
as the next obvious result, we have that (4.34) is true when / 1 (^ 1 )—and 
consequently also f(x)— are simple functions. From Definitions 3.4 
and 3.5 we can infer in a straightforward manner that (4.34) is true 
whenever f(x) is /x integrable on B. 

If fx{x) = f 2 (x) almost everywhere in the /x measure, then 
/K* 1 ) -fK* 1 ) almost everywhere in the /x 1 measure, where f}(x 1 ) y 
i — 1,2, are defined almost everywhere by (4.33). We conclude that 
(4.35) defines a mapping of L\Q y /x) onto L 2 ^ 1 , /x 1 ). It is quite evident 
that this mapping is linear, has an inverse and preserves the inner 
products of the spaces L\Q y /x) and L\Q 1 y /x 1 ), i.e., that it is a unitary 
transformation. Q.E.D. 

Exercises 

4*1 ♦ Show that the relation, “ f(x ) — g(x) almost everywhere,” is an 
equivalence relation in L( 2 )(Q y /x). 

4*2* Check that 

</1 g> = jf *(*) g(x) dp(x), f,gs L 2 (Q, ix), 

satisfies the axioms for an inner product. 

4 3* How can you deduce directly from Theorem 4.3 that / 2 (-f-°o) 
is a separable Hilbert space ? 

4*4* Show that the set C r Q appearing in Lemma 4.2, and the set O s 
introduced in Lemma 4.3 are both countable sets. 

4*5* Show that the set , /x(£?) < +oo, in which we identify any 
two sets whose symmetric difference if of measure zero, is a metric space 
if the distance of R y S e 38% is taken to be d(R y S) = fx(R A S)> 

4*6* Check that the set function /x(-B), Be#, defined as being 
equal to the number of elements in B y in case B is finite, and +oo in 
case that B is an infinite set, is a measure on . Show that L\ IR 1 , /x) 
is not separable. 

4*7* Prove that the set C% introduced in Lemma 4.2 is a Boolean 
algebra of subsets of Q, 
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4*8* Show that the family of all unions of disjoint intervals 

I e lying within an open set Q, Q C (R n , is a Boolean algebra. Prove 
that the Boolean a algebra generated by is identical to . 

5. The Hilbert Space of Systems of n Different Particles 
in Wave Mechanics 

5.1. The Schroedinger Equation of n-P article Systems 

We shall study now the general framework used in wave mechanics 
to describe a system of n particles in which each one of the particles is 
of a different kind from the rest. The requirement that the particles are 
of different kinds is essential because systems containing two or more 
particles of the same kind of atomic or subatomic size have peculiarities 
of behavior reflected in certain statistical features, which require that 
they conform to either Bose-Einstein or Fermi-Dirac statistics—subjects 
that we will discuss in Chapter IV. For instance, a two-particle system 
of the kind we are considering now would be an electron-proton system 
which, when in a bound state, constitutes a hydrogen atom. An example 
of a three-body system of different particles is an electron-proton- 
neutron system, which is called a deuterium atom when it is in a bound 
state. However, tritium, consisting of one electron, one proton, and two 
neutrons, already is not a system of the considered type, except if we 
treat its nucleus as one particle and the electron as another particle. 

We must mention that at present we ignore the possible internal 
degrees of freedom that the particles might have. These internal degrees 
of freedom are reflected in the existence of spin, a concept we shall 
introduce in Chapter IV. 

We assume that our n particles move in three dimensions, and denote 
by r k the position vector of the kth. particle. If we choose in the three- 
dimensional real Euclidean space R 3 a fixed inertial reference system of 
coordinates* characterized by three orthonormal vectors e x , e y , and e z , 
we can expand r k in terms of e x , e y , and e z , 

= x k e x + y k e y + z k e z , k = 1, 2 ,..., n. 

In wave mechanics it is postulated that the state of a system of n par¬ 
ticles is described at any given time ^ by a function i/r(r! ,..., r n ; t) 


# In practice the reference frame is usually tied to the laboratory in which the experi¬ 
ments are performed. For the duration of the typical microscopic experiment such a frame 
moves uniformly with respect to the sun, i.e., it is approximately inertial. Such a system is 
called the laboratory frame of reference . 
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defined on the configuration space U 2n of the coordinate vectors r n . 

It is required in addition that r n ; t) is once continuously 

differentiable in t and that for fixed t it is square integrable,* i.e., 
i/f(r!r n ; t) gL( 2 >((R 3n ), and normalized 

(5.1) f | ^(rj r n ; t) | 2 ••• dr n = 1. 

In general, if a function ^{r 1 r n ; t) can represent a state of the 
given system, then r n ; t) is called a wave function of that 

system. 

As in the one-particle, one-dimensional case treated in Chapter I, §5, 
the wave function ifj(r 1 r n ; t) does not have any direct physical 
meaning. Instead, we have again (as a special case of the Born corre¬ 
spondence rule, which will be given in Chapter IV) that the Lebesgue- 
Stieltjes measure 

(5.2) P t (B) = f | </>(!*! ,..., r w ; t) | 2 dr x ••• dr n , Be J ,3n , 

J B 

is interpreted as a probability measure: P t (B) gives, for every Borel 
set B y the probability of having within B the outcome of a measurement 
at time t of the positions r x ,..., r n of the n particles of a system prepared 
in a state represented by ifj(r 1 ,..., r n ; t). 

In view of Barn’s interpretation, it turns out, as we shall prove in 
general in Chapter IV, that if a state is described at time t by the wave 
function i/j(r 1 ,..., r n ; t ), then the function ci/j 1 (t 1 ,..., r n ; t ), where 
| c | = 1 and ifj(r 1 ,..., r n ; t) = r n \t) almost everywhere (with 

respect to the Lebesgue measure on R 3n ), describes the same state. 
This implies that each function in the equivalence class i fj(t) eL 2 ([R 3n ), 
containing ,..., r n ; t) describes the same state as ,..., r n ; t). 
Thus, we can state as the first assumption + of wave mechanics: 

Postulate Wl* The state of a system of n different particles is 
described at any time t by a normalized vector i p(t) from the Hilbert 
space L 2 ([R 3n ). The time-dependent vector function cifj(t), | c \ = 1, 
represents the same state as i p(t). 


* Throughout this section integrability and measurability are only with respect to the 
measure space ([R 3n , ^’ 3 ” > /w-{ 3n) ). 

+ The postulates we shall state in this section represent a convenient way of system¬ 
atically formulating some of the basic assumptions of wave mechanics. These postulates 
are by no means exhaustive. A complete study of the postulates of quantum mechanics 
will be given in Chapter IV. The reader should find it easy to adapt those general 
axioms to the special wave mechanical case. 
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As an equation of motion we have now a Schroedinger equation of a 
more general form than (5.4) of Chapter I. If the considered system is 
such that in the classical mechanics case the potential energy of the 
system* is V(r 1 r n ), then we arrive at the Schroedinger equation by 
a generalization of the heuristic procedure given in Chapter I, §5. 

Take the energy conservation equation 


(5.3) 


n 


&=1 


Pfc 2 

2m k 


+ 


V, 


where p k and m k are, respectively, the momentum and the mass of the 
kth particle, and make in (5.3) the following formal substitution: 


E ^ ih Tt' 


Pk 


& V* , 

8 


8 , 8 
o “T" o ~r~ ^z o 

dx k v dy k dz k 


We arrive at the formal operator relation 


vh — 
H 8t 


n £ 2 

E s-4. + r. 


(5.4) 


__ d 2 d 2 d 2 

2 ' q.. 2 “T 


fak ' fyk 2 ' d*k ' 


Now we postulate the following. 

Postulate W2* Each wave function ,..., r n ; t), eL 2 (U 3n ), 

|| || = 1, is once differentiable in t. If ,..., r n ; t) is twice differen¬ 

tiable in the configuration space U 3n of the variables r 1 ,..., r n , then it 
satisfies the time-dependent Schroedinger equation, 


(5.5) 



0(r x v, r n \ t) 


= (- E Ak + >•••’ r » ; *)> 

whenever the function on the right-hand side of (5.5) belongs toL( 2 >(lR 3n ). 
In view of the interpretation of P t (B) in (5.2), it is essential that a 

* We are assuming no external force, so that the potential energy V(r x .r„) is 

time independent. Classically, V(ri .r n ) completely describes the interaction. The 

force F* with which the rest of the particles are acting on the &th particle is F k — 
MdV/dx k ) I e v (dV/dy k ) | e,(0r//0* fc )]. 
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wave function satisfy (5.1) at all times. We leave to the reader (see 
Exercise 5.2) to prove the following theorem. 

Theorem 5*1* If the function i/j(r 1 ,..., r n ; t), i p(t) eL 2 ([R 3rl ), is once 
continuously differentiable in t and twice continuously differentiable in 
the configuration space variables, and such that (^l^t)i/j(r 1 ,..., r n ; t) 
and ZLi AfclAfo r n ; t) are square integrable, and if </r(r x r n ; t) 
satisfies the Schroedinger equation (5.5), then 

II <AWII 2 = f , I 0(*iOl 2 dr x ••• dr n = 1 

provided || i/r(0)|| = 1 and | V fc if> | =0(| r k |““<=), <x k > 2, as | r k | oo. 

Using the same procedure as in §5 of Chapter I, we shall try to find 
solutions of (5.5) of the form 

(5.6) <A(r x r M ) exp [—{ijh)Et] 

where i/j eL 2 ((R 3n ). By inserting (5.6) in (5.5) we arrive at the time- 
independent Schroedinger equation for our problem, which can be written 
in symbolic form: 

(5.7) H s 0(r 1 ,...,r w ) = ^(r 1 ,...,r w ), 
where H s is a differential operator form* 

n 2 j 2 

(5.8) H s = - X A* + K(r x ,..., r w ) 
called a Schroedinger operator. 

Thus we have arrived again at an eigenvalue problem (5.7). The same 
terminology applies now as in the one-dimensional, single-particle case 
treated in Chapter I. 

5.2. The Center-of-Mass Frame of Reference 

There is, however, an essential difference (besides dimension and 
number of particles) between the present case and the one already 


* It must be realized that a differential operator form such as (5.8) is essentially only 
a recipe for mapping a certain class 2 of functions into another class JT of functions 
by applying to each function from 2 the procedure symbolically embodied in the operator 
form. Once the class of functions 2 is specified, we deal with a well-defined mapping, 
which is customarily called an operator. However, the same differential operator form can 
be applied to different classes of functions, thus defining different operators. The 
importance, from the mathematical as well as physical point of view, of specifying in a 
given problem the class of functions on which the differential operator form acts will be 
explained in Chapter III, §2, and (5.8) will be studied in Chapter IV, §7. 
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treated in Chapter I: the one particle was moving inside an external 
field of force described by the potential V(x), while at present the 
interaction originates inside the system by having the particles interact 
with one another with forces described by the potential F(r x r n ), 
and we assume no external forces. Thus, classically speaking, the total 
force on the center of mass 


(5.9) 


R 


"Vi + **• + m n r n 

M 


M = m 1 + + m n , 


of the system will be zero (see Exercise 5.1), and the center of mass will 
be in uniform motion in any inertial reference frame. Quantum mechani¬ 
cally, this means that (5.7) does not have any solutions for which 
i/j e L 2 (U Zn ). To see this, let us replace the variables r 1 ,..., r n with new 
variables r/,..., r' n _ l9 r n ' — R, which include the center-of-mass position 
vector R. To be specific, let us choose the following variables: 


r i = r 2 — r 1 , 

r 2 =r 3 -R l5 

(5.10) r;_x = r n — R w _ 2 , 

R = + ■" + m n r n), 

R _ m fc+i r fc+i + •' • + m k) Rfc-i 

k *% + *■■+ m k+l 


k — 1, 2,..., n — 2. 


In order to establish how the Schroedinger operator form looks in these 
new variables, we shall proceed in a few stages. First note that if we 
make in ,..., r n ) the substitution 


(5.11) 


ri = r 2 — r x , 

R = rn^T x + m 2 r 2 


m 1 + m 2 

thus obtaining the function iA(i)(r 1 / , R x , r 3 ,..., r n ), we have 

< 5 - l2 > + . r "> 

“ (-ST 4 ‘' + 2jS7T«5 A "') Mt '' R ' ’ r *. rJ ’ 
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where M 1 is the reduced mass of the first two particles: 

m 1 + m 2 

02 0 2 02 

(5.13) Aj Qx' 2 ^ By ' 2 8s' 2 ’ r i ~ x i Ji t«i 

0 2 02 02 

^Rl 2 gy 2 ~ ~8Z~ 2 9 ^ l e i/ ”1” • 

We shall proceed by induction in k. By performing in the function 
4>(k- r&_i, R fc -i, r, c+1 r n ) the substitution 

*k = ffc+i - Rfc-i, 

(5.14) = m k+1 r k+1 + (w x + ••• + m k ) R fc _ x 

k m 1 + ---+m k+1 

we get the function tfi ik) ( r/,..., r fc ', R*. , r, c+2 ,..., r m ), and we have 

(5 ' 15) ( 2 ^ Afc+1 + 2K + - + m k ) Ar ^) 

X *A(fc—l)(**l j***j ^k— 1 j 1 j **7c+1j‘“j *%i) 

= ('2MT + 2K + - + m fc+1 ) Ar *) 

X 0( fc )(ri j«««j , R fc , ^*fc+2 j***j 

where, for ft — 1 , 2 ,..., n — 1 , 

M = (”»! + — + ^fcW+l 

fc iff! + — +m fc+1 

a 2 a 2 a 2 

(5.16) A fc = fix'* ^y'2~ 3z'^ 9 *k ~ X k^x yu z k ^z > 

a 2 a 2 a 2 

^Rjj. ^2 fly 2 fly 2 ’ ^k ^k£x ^k^y “I - ^k^z • 

Thus, by induction in ft we easily get from (5.12) and (5.15) 



X 0(n—l)(l*l »•••» ** n _ 1 > ^)> 


(5.17) 
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since from (5.11) and (5.14) it follows that 

n _ m n r n + «„_!r„_ x + ••• + D 

- ---1- - tv. 

m l + “* + m n 

Now, in physics the only potentials used are those which, from the 
classical point of view, correspond to forces which satisfy the action- 
reaction principle. This implies (see Exercise 5.3) that when F(r x ,..., r n ) 
is expressed in terms of the variables (5.10), it does not depend on R. 
Thus, in the new variables the Schroedinger operator form is 

i/ 

h 2 n ~ 1 h 2 

(5.18) Hs =- m a r - £ A/ + 

If we seek now a solution of (5.7) of the form 

(5.19) 0(r x ,..., r n ) = 0 C (R) W,..., <-i) 

we arrive at the following system of differential equations: 


(5.20) 


— E c i/j c (R ), 

(5.21) 

r n -1 *2 

[ ^ 2 M } Ai + V( ~ ri 

<-i)] nr 


= E b W( r/,..., rU), 



E = E c - 1 - E \). 



Following the classical tradition, E c is called the energy of the center of 
mass of the system, E h is the internal energy of the system, and E is the 
total energy. 

The transformations (5.10) obviously map [R 3 ™ into itself. Because of 
the linearity in r 1 ,..., r n of those transformations, the Jacobian 

T = g(f! r n ) 

J Sir,',..., r;_ x , R) 

of the transformation is obviously a constant in our case. Thus, we can 
write (Theorem 4.9) 

f . I'/'(«'! r»)l 2 — dv n 

J R 3n 

= / f ,1 H r l .r»-i. R)l a *i' ••• dr ’n -1 dR > 



126 


II♦ Measure Theory and Hilbert Spaces of Functions 


where i/^r/,..., R) is the function which we obtain when we express in 
0(r x r n ) all r 1 r n in terms of r/,..., R. If i/q is of the form (5.19), 
then we have by Fubini’s theorem (Theorem 3.13) 

f | 0(r lv ..,r n )| 2 

J R 3n 

= / f | MW dR f W( ri ' .r;^) dr,’ - drU , 

J R 3 J R 3(n_1) 

i.e., i/jeL 2 (U Sn ) if and only if i/r c eL 2 (IR 3 ) and WeL 2 ( rsi^-D). However, 
the equation (5.20) for the energy of the center of mass of the system 
does not have any eigenfunctions i p G eL 2 ([R 3 ). Its eigenfunctions 

“2M ARexp (y pcR ) = 2^ exp (y pcR )’ PceR3 ’ 

are not square integrable, corresponding to the fact that, as we shall see 
in Chapter IV, §7, the operator A R does not have a point spectrum, but 
only a continuous one. Physically, this is due to the fact that, because of 
the lack of external forces, the center of mass of the system is in free 
motion. 

5.3. The Bound States of n-P article Systems 

Postulate W3* The set S p of all eigenvalues of the equation (5.21) 
which correspond to square-integrable eigenfunctions are the only 
internal energy values which the n-particle system in a bound state can 
assume. The closed linear subspace ^f ( b n) of L 2 (lR 3n ), spanned by all 
i fi eL 2 ((R 3n ) for which 

W,, C,) = 0(r x ,..., r n ), e L«(R»<-i>), 

is an eigenfunction of (5.21) for every R e 1R 3 , contains all the Hilbert 
vectors which can represent, at a given time t, a bound state of the 
n-particle system* interacting via the potential 

V(ri ,..., r n ) = r;_ x ). 

The interpretation of an eigenvalue E h of (5.21), with eigenfunction 
W eL 2 (lR 3(n_1) ), as internal bound energy is possible only if E h is a real 
number. This will be so if we can generalize Theorem 5.1 in Chapter I. 


* We talk about bound states of the entire w-particle system, which leaves out, e.g., 
when w ~ 3, the case when two particles are bound, while the third one is free! We shall 
study this more general problem in scattering theory. 
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Theorem 5*2* The Schroedinger operator form H s , the center-of- 
mass energy operator form 


(5.22) h,= ~ — A r 

and the internal energy operator form 

(5.23) Hj = - ? ^ A/ + Vft,..., rU) 

define Hermitian operators when applied to twice continuously differen¬ 
tiable functions, which together with their first derivatives vanish 
faster than | r k | -1 , k — 1 ,..., n, as | r fc | —► oo. 

A differential operator A defined on L 2 (R m ) is said to be Hermitian 
if, whenever Af and Ag are defined for /, £eL 2 ((R m ) and belong to 
L 2 ([R m ), then 

(5.24) <Af\g> = <f>\Ag}. 

Proof. Denote by S r the sphere in IR 3 of radius r and with center at 
the origin, and by B r its surface. By applying Green’s theorem twice 
on S r n = S r X • • • X S r e (R 3n , we get 


(5.25) 


f (#s fYgdr.-drn 

J s r n 

+ f f*{H s g)dt 1 — dr n , 

Jan 


where d/dn k is the derivative in the coordinates r k in the direction of 
the outer normal to B r . When we let r -> + 00 in (5.25), we get 


f , ( H sf)*g d *i d *n = f f*(Hsg) d ^i dr n 

J r 3 « J r 3 « 

because the surface integrals entering in the sum on the right-hand side 
of (5.25) have to vanish, since the functions |/|, \ g \> \ dfldn k |, and 
I I ^ n k I are assumed to vanish faster than | r k | _1 when I v k | —> +oo. 
The proof for H c and is completely analogous. Q.E.D. 

If W 1 , ¥* 2 eL 2 (IR 3(n ~ :I) ) arc eigenfunctions of (5.23) corresponding to 
the eigenvalues E [and E\f\ respectively, then we also have, by (5.23), 
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that H^P ly gL 2 (IR 3(w_1) ). Consequently, by Theorem 5.2 (in 

Chapter IV we shall see that the condition at infinity can be dropped): 

(5.26) ePxv, I w 2 y = I w 2 y = <5* I HiF 2 y = 4 2) <^11 ^>- 

In case that W 1 = W 2y and therefore £| ) 1) = E^ ] — E h , the above 
relation implies that E h = E b *. If £'{ ) 1) E^ ] we get from (5.26) that 

| ¥^ 2 ) = 0. Thus, we have arrived at the following conclusion. 

Theorem 53* Each eigenvalue E of the internal energy operator 
H { in (5.23), belonging to an eigenfunction !P GL 2 (R 8(n_1) ), is a real 
number. Eigenfunctions corresponding to different eigenvalues are 
mutually orthogonal. 

Thus, to each eigenvalue belongs at least one nonzero eigenvector, 
and the eigenvectors belonging to different eigenvalues are orthogonal 
and therefore linearly independent. Since L 2 (lR 3(n_1) ) is separable, any 
orthogonal system of vectors contains at most a countable number of 
elements. Hence, we can state Theorem 5.4. 

Theorem 5*4* The number of bound-state energy eigenvalues is 
at most countably infinite, i.e., the point energy spectrum S p contains 
a countable number of elements. 

5.4. Properties of the n-P article Schroedinger Operator 

An immediately noticeable feature of the Schroedinger operator 
form in (5.8) is that it is not defined on all the elements of L 2 ((R 3n ). The 
class of functions from L 2 ((R 3n ) on which H s is defined depends on the 
particular form of the potential V(r 1 ,..., r n ). Broadly speaking, the 
more singularities V(r 1 ,..., r n ) has the narrower the class on which H s 
can be defined. In the next chapter we shall find Theorem 5.5 very 
useful. 

Theorem 5*5* If V(r 1 ,..., r n ) is square integrable on every bounded 
measurable set in (R 3n , then the differential form H s when applied on 
the functions of the family ^(lR 3rl ) of infinitely many-times differentiable 
functions with compact support* in [R 3n , maps linearly the set ^([R 3n ) 
(where obviously ^([R 3n ) C L( 2 )((R 3n )) into L( 2 )((R 3n ). 

Proof. If /e^£(R 3 ”), then / can be represented by a function 
f(r 1 ,..., r n ) which is certainly continuous and thus measurable, and 
vanishes outside a bounded open set S, i.e., it is integrable and square 

* The support supp / of a function / is the closure of the set of all points at which the 
function is not zero. In IR W a set is compact if and only if it is closed and bounded. 
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integrable on (R 3n . By the hypothesis of the theorem, V(r 1 r n ) is 
square integrable on S. Thus, V(r 1 r n )f(r 1 r n ) is integrable and 
therefore also square integrable, as we can convince ourselves by using 
an inequality like (4.7) and applying Theorem 3.9. Furthermore, we 
evidently have that for every k — 1,..., n 

A fc /eqr)CL (2) (r), 

and therefore we can conclude that H s feL 2 (U 3n ). Q.E.D. 

The importance of the preceding theorem is related to Theorem 5.6. 

Theorem 5*6* The set of all elements of L 2 (R m ) which can 

be represented by infinitely many times differentiable functions of 
compact support is a vector subspace ofL 2 ([R m ), which is dense inL 2 ([R m ). 

Proof. The set ^((R m ) is obviously linear. That ^°((R m ) is dense in 
L 2 ((R m ) can be demonstrated in the following way: 

According to Theorem 4.8, for any given / e L 2 (lR m ) we can find a 
simple function 

V 

s ( x ) = Z c kXh (x) 

(where each I k is a finite interval) such that 

11/-'ll <«/2- 

If we could prove that the characteristic function of any finite interval 
can be approximated in the mean arbitrarily well by an infinitely many- 
times differentiable function of compact support, then for each I k we 
could find a g k e &£() such that 

(5.2 7 ) 

Therefore, the fact that ^ (R m ) = £ 2 ((R m ) would be established because 
we have 

g = Z C ’^ic e 

fc=l 

and for the arbitrarily chosen /eL 2 (lR m ) 

11/--*11 <ll/-*ll+ll*-*ll 

<11/-*11 + Z 1^111X4-^11 <«• ' 

/c« 1 
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Any finite interval I in U m is a direct product of m finite intervals in IR 1 . 
Consider a semiclosed finite interval 

( 5 . 28 ) I = Oi, b t ) X ••• X [a m , b m ) 

whose characteristic function can be written as 


( 5 . 29 ) 


m 

Xi( x 1 v, x m) = n [8( x k — a k ) — 8(x k — 


1 


&*)], 


where 6(x) is the step function 



for x < 0 
for x > 0 . 


Assume that we have at our disposal a sequence h^x), h 2 (x),..., x e IR 1 , 
of functions which are monotonically nondecreasing 

h n (x') < h n (x "), x* < x\ n — 1 , 2 ,..., 


infinitely many times differentiable in x , and such that 


where e n > 0 and 


h n (x) = 


0 

1 


for x < — e n 
for x ^ 0 , 


lim € n = 0 . 


An example of such a sequence is provided by 


*»(*)= ex P (- f” 2 ^ ) 

(l 


for x ^ — l[n 
for — 1 /n < x < 0 
for # > 0 . 


We have 


( 5 . 30 ) 



K{x) ~ %)| 2 dx ^ e w 


0, 


n —> +°o* 


A n (^ x m ) = n [*»(** — a k) — *«(** - **)] 

7c~ 1 


The function 
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obviously is infinitely many times differentiable and of compact support. 
It is easy to derive from (5.29) and (5.30) that 

lim f „ I Xl( x l x m) — K{ x -L x m)\ 2 d x 1 ••• dx m = 0. 

Thus, we have proved that a g k e ^°(R m ) satisfying (5.27) can always be 
found when I k is a semiclosed interval like (5.28). Since the difference 
between I in (5.28) and any other interval with the same end points as I 
is of Lebesgue measure zero, our result applies to any finite interval 
in W*. Q.E.D. 


5.5. The Initial-Value Problem 

The time-dependent Schroedinger equation (5.5) is of the first order 
in t. Hence given the initial condition at some time t — t 0 in the form 
of a twice differentiable function 0 (O) (r x ,..., r n ), there will be at most 
one solution i/j(r 1 ,..., r n ; t ) of (5.5) which satisfies the initial condition 

(5.31) 0(r x r n ; t 0 ) = 0<°>(r 1 ,..., r n ). 

In the case where gL 2 (R 8(w ” 1) ), 


¥r 0 Vi',-, Cl) = 'A < 0 ) ( r i..... r M ), 


is a bound state, i.e., 0 (O) e we have the result 


(5.32) 

X exp [i-RK -± (eV + (f - 4)] I $T> ^ , 
where (see Chapter III, §4.3 on Fourier transforms) 
r n ; t) = .r;_ t ; /), 


<W V | ^< 0> > = f y v *(r 1 ',..., rCr) C) , 

J m*3(W—1) 


rU) = (2^ <-i) 


is the solution of the initial value problem (5.31). In (5.32), , W 2 >•••} 

is an orthonormal basis (in general, countably infinite) of the space 
.#i ( ) n ~ 1) (spanned by all the eigenvectors W eL 2 ((R 3{n_1) ) of Hi) which 
consists of eigenvectors of : 


W - . 
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One can arrive at the conclusion that such a basis exists by a straight¬ 
forward generalization of the reasoning employed in §5 of Chapter I. 

By duplicating the proof of Theorem 5.2 in Chapter I, it can be 
established that if the series (5.32) is infinite, it has to converge in the 
norm, and that its limit has the property that 0(£ o ) — 0 O . The fact 
that (5.32) really satisfies the equation (5.21) can be checked by inserting 
(5.32) in (5.5) and differentiating term by term. Naturally, in this 
procedure we have to assume that the order of summation and differen¬ 
tiation can be interchanged—a very arbitrary, and sometimes incorrect 
assumption, when the series is infinite. However, it follows from the 
general theory of linear operators expounded in Chapter III that (5.32) 
really represents the general solution of the initial-value problem for 
bound states. 

Exercises 

5*1* Show that if there are no external forces acting on an n-particle 
system, then classically, the total force F = MR (R and M are given in 
(5.9)) acting on the center of mass R of the system is a zero vector. 

5.2. Prove Theorem 5.1, and justify each step in the proof by 
applying well-known theorems of the calculus. 

5.3. Show that as a consequence of the action-reaction principle 
V R V = 0 in the absence of external forces. 

5.4. Prove that the series in (5.32) converges in the norm, and that 

its limit ( 1 0 ) for t = t 0 is equal to ¥ / ^ 0) . 


6. Direct Sums and Tensor Products of Hilbert Spaces 

6.1. Direct Sums of Euclidean Spaces 

In this section we shall study two important procedures of constructing 
new Hilbert spaces from given Hilbert spaces. These constructions— 
the direct sum and the tensor product of Hilbert spaces—have a great 
variety of applications to problems in quantum mechanics. With these 
applications in mind, we shall not treat the most general concept of 
such spaces that appears in the mathematical literature, but rather 
restrict ourselves to the level of generality which is sufficient for treating 
the mathematical problems to which nonrelativistic quantum mechanics 
gives rise. Such applications will occur frequently in §7 and in 
Chapter IV. 
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Theorem 6*1 ♦ Let , <^ 2 be a finite or countably infinite 
sequence of Euclidean spaces, with inner products <■ | , <-1 *> 2 ,...,respec¬ 

tively. The family of all sequences 

/=(/l,/.,-)G^X^X - 

for which 

(6.1) 'Yj (fk I fk)k < + 00 

k 

is a Euclidean space—customarily denoted by 

©** = *!© *2® - 

k 

and called the direct sum of ^ ^ r which the vector operations 
and inner product of /, £ e @ fc , / = (f x ,/ 2 g = (gi, g 2 ,-)» are 
defined in the following way: 


/ + £ —- (/i +£i j/2 + £2 >•••)> 

(6.2) af = (a/i , af 2 ,...), 

</1 £> = Z </* I !'»>* • 

k 

Sequences (/1 ,/ 2 ,•••) satisfying (6.1) are denoted by f x ®/ 2 ® ••• . 

Proof ‘ If we have a finite number of Euclidean spaces ^1 ,.••> , 

then each element/of ^ x ••• X obviously satisfies (6.1) and there¬ 
fore / is also an element of © • • • © <g n . It is a straightforward task to 
check that / + £, 0 / and </ | £> are defined for all /, g e ^ ® • • • © 
and scalars a, and satisfy all the axioms for vector operations and inner 
product, respectively. 

If we have a countably infinite number of Euclidean spaces , <^ 2 
we have to check first whether the expressions in (6.2) are really defined 
for all /, g e ® fc=1 $ k . Since we have 

t II «/* II* 2 = I« I 2 £ II h IIA Wfk Ik = «/* IA» 1/2 > 

fc=l fc=l 

it follows that af e @ k= i $ k whenever / e ©£ =1 & k . 

In order to establish that / + g e @ fc=1 when /, £ e ® fc=1 ^ , we 
apply the triangle inequality to the space ©£ =1 S k to obtain 

( v A 1 / 2 / v A 1 / 2 / v A 1 / 2 

S II/* + £* life) II/* lift) + (Xll£*llfc) * 

*-1 / \*-i / \*-i / 
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From the above inequality we can immediately conclude that 
f + ge ©*Li &k by letting v -* QO. 

In order to show that 

(6.3) £<Alft>* 

fc=l 

is absolutely convergent (and therefore also convergent) whenever 
(A ./a.-) and (ft, ft,...) belong to ©* =1 £ k , we use the Schwarz- 
Cauchy inequality in each of the spaces $ k to obtain 

(6.4) t KA I ft A I < i II/* II* II lift • 

1 1 

Since || A Hi»-, IIA II, and || ft Hi ,•••, II ft lb are numbers, by applying 
the Schwarz-Cauchy inequality to l\v) we get 

v r v "i 1/2 r v ~i 1/2 

(6.5) I IIAIIftllftll* < I HAIL 2 £ liftII** • 

k =i L&=i J L&=i J 

From (6.4) and (6.5) we can obviously deduce the absolute convergence 
of (6.3). 

Thus, we have established that the expressions (6.2) are really defined 
for every/, g e @ k=1 . We leave to the reader the straightforward task 

of checking that all the eaxioms of a Euclidean space are fulfilled when 
in addition we take the sequence (O x , 0 2 ,...) of the zero vectors O x , 0 2 ,... 
of , (p 2 respectively, to be the zero vector of . Q.E.D. 

As examples of direct sums of Hilbert spaces, note that l 2 (m + w) is 
isomorphic to l\m) © l 2 (n), and that 

/ 2 (+oo) = / 2 (1) ® / 2 (1) @ ••*. 

6.2. Separability and Completeness of Direct Sums 
of Hilbert Spaces 

In proving the next two theorems, we shall use the same method as 
in proving the separability and completeness of / 2 (® oo) in Theorem 4.3 
of Chapter I. 

^Theorem 6.2♦ If , S 2 ,... is a finite or infinite sequence of 
Euclidean spaces, their direct sum @ /c S k is a separable Euclidean space 
if and only if each of the spaces ,... is separable. 
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Proof. We shall consider only the case where we deal with an infinite 
sequence , S 2 ,••• of Euclidean spaces. 

Let S 1 , S 2 ,... be countable everywhere dense subsets of S x , S 2 
respectively. Now, all sets of the form 

R v = S x x ••• x S v x {0 V+1 } x {0 y+a } x ••• 
are countable subsets of ©* =1 <t> k (0 & is the zero vector of Hence 

R= 0^ c ®^ 

v=l k=1 


is a countable set, which is also dense in @ k=x ^k • Namely, for any 
given / = (/ x ,/ 2 ,...) e @ fc=1 <? k and € > 0, there is a positive integer n 
for which 

t II/* II* 2 < e /2- 

&=n+l 

Furthermore, there are g x e ,..., g n e S n such that 

II/* — £*IIJ < e /2«, & = 

because S k is dense in $ k . Therefore for 

g = (gi >•••> , 0 W+1 ,0 n+2 ,...) g R n 


we have 

n , oo 

II/-*11* = </-*!/-*> = Z II/* - ft II 2 + I ll/*ll 2 <*- Q-E.D. 

k =1 &=w+l 

^Theorem 63* If ,... is a finite or countably infinite 

sequence of Hilbert spaces, their direct sum © fc 2/f k is also a Hilbert 
space. 

Proof. We shall consider explicitly only the more difficult case of an 
infinite sequence of Hilbert spaces ,... . 

Lct/ (l) ,/ (2) ,... g ©fc =1 ^r fc , 

/W _ (/W,/W ...), 

he a Cauchy sequence in @ k= i k . Since 

ii/i" 0 ./i n) ii* [£ n/l m) -/i n) irf' 2 = n/ (m) -/ (n) ii, 

L ^i J 
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the sequence * s a Cauchy sequence in ^ k , and has a limit 

f k e M ? k , because M ? k is complete. 

By using the triangle inequality in l 2 (v) we can derive that 

r v "i 1/2 r v "ii/2 r v ni/2 

(6.6) E IIa -ft* II, 2 < E IIA -ft* II, 2 + E II/i m) -/1 M> II, 2 

Lk=i J L&=i J L&=i J 

for each m — 1, 2,... . Since/ (1) ,/ (2) ,... e @ ks=1 $? k is a Cauchy sequence, 
for any e > 0 there is an integer iV(e) such that 

t ll/i m) -/^ II, 2 < ll/ (m) II 2 < e 2 /4 

k =1 

for all m, n > iV(e). Furthermore, since/ fc — lim m ^ 00 /J. m) for each fixed 
positive integer, there is an integer N v (e) such that 

IIA — ft*\\k < 2- (i:+2)/2 e, k = 1, 2,..., V, 


for m > iV(e). Thus, from (6.6) we find that 



for all n > N v (e). By letting v —> oo in the above relation we derive for 
any n > N(e) 

(6.7) £ ||A -ft* II, 2 < * 2 - 

k =1 

The triangle inequality in J^ ? 1 © ••• ® J(? v yields 



* , , 11/2 r * . , 11/2 

E IIA — ft* II , 2 + E ll/* B> H,® • 

-fc= i j L&=i J 


From the above we can immediately infer that 

£ II/, II , 2 < 

k= 1 

by choosing a fixed n = n 0 > iV(€) and employing (6.7). Thus, 
/= (A ,/ 2 »•••) e ®,=i , and b y (6-7),/= lim lt _, 00 / (?t) . Q.E.D. 

It is easy to see that the operation of constructing direct sums of 
Euclidean spaces is commutative and associative (see Exercises 6.2-6.4). 
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6.3. Bilinear Forms on Vector Spaces 

In order to be able to study tensor products of Euclidean spaces, we 
need to familiarize ourselves with some of the properties of bilinear 
forms. 

Definition 6 A. A bilinear form (• | •) on a vector space* is a 
complex-valued function defined on Y' x 'V' y having the property that 
for any /, g, he 'V' and any complex number a 

(/1 ag) = a(f | g) = ( a*f \ g), 

(f\g + h) = (f\g) + (f\h), 

(f + g\h) = (f\h) + (g\h). 

A bilinear form (/ | g) is called Hermitian if for all /, g e Y' 

(J\g) = (g I/)* 

A Hermitian bilinear form is said to be positive definite if 

o 

for all fe i r . 

If (• | •) is a bilinear form, the function (f\f) on is called the 
quadratic form associated with (• | •) (see also Exercise 6.6). 

We note that an inner product on Y' is a positive-definite bilinear 
form for which (/ \f) = 0 only when / = 0. 

As an example of a bilinear form on l 2 (ti) we can take 

(6.8) (*\P) A = t a i* A tJ>k, « = l ] 

itk=1 \ a n 

where A is an n X n matrix 



An An 

A n i * * * A nn 

Anticipating the results we shall obtain in the next chapter (see 
Chapter III, Exercise 2.5), we can state that, vice versa, to each bilinear 
form (• | •) on l 2 (n) corresponds a unique matrix A in terms of which 



# Throughout this section we consider only complex vector spaces and on them (• | •) 
is sometimes called a sesquilinear form. All the definitions and theorems can be easily 
specialized to real vector spaces by limiting the field of scalars to [R 1 . 
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(a | ]3) can be written in the form (6.8). Furthermore, (• | •) is Hermitian 
if and only if the matrix A is Hermitian. 

By replacing in the proof of Theorem 2.2 in Chapter I the inner 
product </1 gy by a positive-definite bilinear form (/1 g), we arrive at 
the following conclusion. 

Theorem 6*4* If (• | •) is a positive-definite bilinear form on the 
vector space Y, then (/1 g) satisfies the Schwarz-Cauchy inequality 

\(f\g)\ 2 <(f\f)(g\g) 
for any two vectors /, g e Y. 

Theorem 6*5* If (f\g) is a positive-definite bilinear form on the 
vector space Y y the set 

*o = {/:(/l/)=0} 


is a vector subspace of Y. 

Proof. If /e Y 0 and a e C 1 , then ( af | af) = | a | 2 (/ |/) = 0, i.e., 
afeY 0 . 

Take any two vectors f,geY 0 . From Theorem 6.4 we get 
\(f\g)\ 2 <(f\f)(g\g)-o 
and therefore (/1 g) = 0. Consequently 

(f + g\f + g) = (/I/) + (g I/) + (/1 g) + (g I g) = 0, 

i .t.J + gsY 0 . Q.E.D. 

Lemma 6*1* Let Y be a vector space and Y x a vector subspace 
of Y\ then the family YjY 1 of all sets*/of the form 

f={f+h:heYJ, feY, 

is also a vector space—called the quotient space of Y and Y 1 — if the 
vector operations are defined in the following way: 

af=W + hy-heYJ, 

(6.9) 

/+! = {/ + * +*: 

and the zero vector 0 of i r ji r 1 is chosen to be 

(6.10) 0 = {0 + h: h e = Y'i . 


* Here,/is called the equivalence class of / modulo . 



6* Direct Sums and Tensor Products of Hilbert Spaces 


139 


Proof. It is easy to see that the above operations (6.9) are not depen¬ 
dent on the chosen representative from Y' of an equivalence class 
belonging to Y'\Y' X . For instance, if A = / 2 modulo Y x , i.e., 

A ={A+h:h e *3 = {f 2 + h: he = / 2 , 

then/ 2 e/i so that there is an Aj £ for which/! -f hi — A • Therefore 

fi + g = A + g ( m ° d ^i) 

because for any hen 

(A + g) + h = (A + g) + h', h! = h — h l9 

where ti e Y' x , since h, h x e and Y^ is a linear space. 

It is a straightforward task to check that the operations (6.9) satisfy 
all the axioms for vector operations if (6.10) is taken to be the zero vector 
in Y x . Q.E.D. 

Theorem 6*6* If (• | •) is a positive-definite bilinear form on the 
vector space Y', and Y Q is the vector subspace on which the quadratic 
form (/1/) vanishes (see Theorem 6.5), then 

(6.11) </!!> = (/1 £), /,|eir/Tr 0 , fgeY, 

is an inner product on the quotient space Y*/Y 0 . 

Proof. First, we have to establish that </1 g) is uniquely defined for 
every/, ge't'm, he., that when/! = / 2 (mod T^ 0 ) and^ = g 2 (mod n>)> 
then ( f 1 | ft) = (/ 2 | ft). To that purpose write 

(6.12) (A I ft) = (A “A 1*0+ (A I ft) 

= (A - A I ft) + (A I'ft - ft) + (A I ft)- 

Since (/1 g) is positive definite, we can use the Schwarz-Cauchy 
inequality (Theorem 6.4) to write 

I (A -A I ft)l 2 < (A - A IA - A)(ft I ft), 
l(A I ft - ft)l 2 < (A I AXft - ft I ft - ft), 

and then note that the right-hand sides of the above inequalities vanish 
because (f x — A IA “ A) = (ft ~ ft I gi — ft) = 0; thus > we g et from 
(6.12) that (A | g x ) - (/ 2 | ^ 2 ). 

It is easy to check that </|£> is a positive-definite bilinear form on 
Y/Y {i . Furthermore, if <(/1 /) = (/ \f) = 0 it follows that feY 0 , 
i.e., / YA , where is the zero vector of Y/Y 0 . This establishes the 
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theorem, since a positive-definite bilinear form whose quadratic form 
vanishes only on the zero vector is an inner product. Q.E.D. 

It is worth noting that in §4 we implicitly used Theorem 6.6 when 
we constructed the Hilbert space L 2 (Q, fi) from the vector space L( 2 )(Q, /x). 

*6.4. Algebraic Tensor Products of Vector Spaces 

Consider now the Cartesian product ^ X X i^ n of n sets 
,..., i r n , which are vector spaces. Introduce in x ••• X n P r n 
equivalence classes (see Exercise 6.7) by the identification 

(6.13) (ah™) X h™ X ••• X h <"> - h™ X (ah™) X - X h <"> = ••• 

- h™ x h < 2 > x ••• X (ah^) 


for all ae C 1 and all h (1) x ••• X h in) e ^ x ••• X 'V' n , and denote by 
(i r 1 X X the family of all such equivalence classes. We shall 

use the symbol h {1) ® • • • ® h {n) to denote the equivalence class con¬ 
taining h {1) X **• X h {n \ i.e., 


h™ ® ••• ® A< n > 

= {(^ (1) ) x ••• X (c n h {n) ): c x ••• c n = 1, q ,..., G C 1 }. 

Define in (i r 1 X ••• X ^ n )~ an operation of multiplication by a 
scalar a e C 1 , 

(6.14) a(/* (1) ® ••• ® h <">) = (oA^) ® ••• ® /* (w) . 

Note that if 0*. is the zero vector of i r k , then 

ah {1) ® ••• ® 0 fc ® ••• ® h {n) = h (1 > ® ••• ® 0 fc ® ••• ® h <"> 

for all a g C 1 , i.e., 

(6.15) 0 X ® /* (2 > ® ••• ® K n) = ••• = /* (1 > ® /* (2 > ® ••• ® 0 n 

for all h a) g i'l ,..., A (n) g . We shall call (6.15) the zero element of 
(n X ••• X ^ n )~, and denote it by 0. 

Consider now the family of all v-tuples (f x ,...,/„) of elements 
from X X ^ n )~ , and introduce in this family equivalence classes 
(see Exercise 6.8) by the following identifications: 


( 6 . 16 ) 


C/l (fki y'yfki) 
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for any permutation f kl ,...,f kp of f lt ...,f v e (V'i X ••• X 

(6-17) (/i./,,0) = (A 

for any f x X " X ^)~, and 

(6.18) (*«> ® - ® A<*> ® - ® *«»>, *«> ® - ® *«> ® - ® A<">, A ,...,/„) 

= (A™ ® - ® (A<*> + A<*>) ® - ® *<•>,/, ,...,/„) 

for any A,...,f v , h (1) (x) ••• (x) h tn) e X ••• X 'f r n )~- Denote the 
family of all such equivalence classes by Y'i (x) 0 • • • (x) a . Introduce 
in X ••• X T^)~ operations defined in the following way: 

(6-19) (A + (ft = (A ./ B , ft 

(6.20) *(A ..... A) = (*A ..... a/ w ), a e C 1 . 

It is a straightforward though tedious procedure to check that (6.19) 
and (6.20) leave the equivalence classes of i r 1 (x) a ••• ® a '^ r n invariant, 
and that they satisfy all the axioms imposed on vector operations. 

Theorem 6.7. The set ^ ® a ••• ® a i r n of equivalence classes 
constructed from elements of the Cartesian product X ••• X ^ of 
vector spaces ^ i^ n according to the equivalence relations (6.13)— 
(6.14) and (6.16)—(6.18) is a vector space (called the algebraic tensor 
product of ^ 7^) under the operations (6.19) and (6.20)—having 

as a zero vector the equivalence class containing (0), where 0 is the zero 
element of (^ X ••• X ^)~. 

Since we can obtain for any /= (A ,...,/„) e ^ (x) a ••• ® a i r n , by 
repeated application of (6.19), that 


(/i »-»A) = (A) + — + (A)> 

we can state another lemma. 

Lemma 6.2. The subset [i r 1 x ••• X ^)~ of ^ ® a •” ® a ^n 
spans the vector space ^ ® 0 — ® a ^ r n l every / e ^i ® a ®a^n 
can be expanded in the form 

/ = *(i> (x) - <g> A<"> + - + h?> ® - <g> h \w , 

(6.21) a«> E ^*i-> E . 

In the above relation we have simplified the notation by agreeing to 
write h = (A) for any x ••• X ^ n . 
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Assume now that the bilinear forms (• | | *) n are defined on 

the vector spaces i^ n , respectively. We can define a complex 

function (/ | g) for all /, g e ^ X ••• X i r , j in the following way: 

(6.22) (/«> ® - <g)/<*> I £ (1) ® - ® *«*>) = fl (/'*> I g m )k ■ 

k=l 

Note that (6.22) really determines a unique function on 
(T, X - X rx X (^ X - X n)~: 

((«/ (1) )®/ (2) ®-|^ (1) ®^ (2) ®-) 

= («/ (1) U (1, )i(/ <2> k (2, ) 2 ••• = (/ (1) I ^ (1, )i(«/ <2) k (2, ) 2 - 

= (/ (1) ® («/ (2) ) ® -1 g a) ®g {2) ® •••). 

By Lemma 6.2 we can write any f,g£ ^ ••• ® a i r n in the form 

/=i/r > ® -®/r > , 

(6-23) ^ 

£ = X 4 X) ® ®gf ) - 

j=l 

If we define 

Vl V2 

(6.24) (/ k) = X Z (/^ ® • • • ® /«»> 14 1 ' ® • • • ® *<*>), 

i=l j=l 

it is easy to check that the above expression does not change when any 
of the substitutions suggested by (6.16)—(6.18) are performed, i.e., that 
(6.24) extends the domain of definition of (• | •) to (x) a ••• (x) a i^) x 
(^i ®a ®a ^n)- Furthermore, it is very easy to verify that (• | •) is a 
bilinear form on ^ ® a ••• ® a i^ n . 

If the bilinear forms (• | ,..., (• | -) n are Hermitian, it is evident 

from (6.22)-(6.24) that (• | •) will also be Hermitian. 

We shall prove now that (• | •) is a positive-definite bilinear form 
whenever (• | -)i(• | -) n are positive definite. We need Lemma 6.3. 

Lemma 6*3* If (/i,/ 2 >—) is a finite or countably infinite system 
of vectors in a vector space then there is a countable system 
{e 1 , e 2 such that (e i | ej) = and each vector f t can be expanded 
in the following way: 


Ui 

fi ~ ^ “I - fi > 

i-1 


(fi I//) = 0. 


(6.25) 
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Proof. In the quotient space , 

^o={/:(/l/) =0,/e^} 

the bilinear form (/1 g) induces, according to Theorem 6.6, the inner 
product </|£> defined in (6.11). We can apply on {/i,/ 2 ,...} 
Theorem 2.4 in Chapter I to deduce the existence of a system , e 2 ,...} 
for which (j> i \ ef) — Sy and 

n i 

(6.26) /< = S «««; • 

1=1 


If e i is a chosen vector of belonging to the equivalence class , 

(6.26) implies, according to (6.9), 

ni 

fi “I ~ Si Z “I - hi ? hi , gi E . 

3=1 

We get (6.25) from the above by noting that// = h t — g i e ^ . Q.E.D. 

In order to show that (f\f) ^ 0 for any given ® a ••• ® a 1 r n 

when (/1 g) ± ,..., (/ | £) n are positive definite, expand/in the form (6.21). 
According to Lemma 6.3 we can find for each k — 1,..., n vectors 
e!** in terms of which 

Pie 

( 6 . 27 ) *<*> = Z 

i=i 

where (g (k) \ g\ k) ) k = 0, and therefore (f {k) \ g ik) ) = 0 for all f ik) e Y' k . 
Thus, we have 


(6.28) 


(/I/) 


V 


II 

M 

(A) 1 ' | 


•••(; 

^(w) 

i 

1 

i,j=l 





V 






II 

M 

Z " 

• z 

/Tf (1) ♦ 

a a) 

oh 

... a <»>*, 

K'n 

ij= 1 

i 1= i 

*«=1 




Pi 

Pn 

/*> 



\*/ v 

ll 

M 

- Z 

Z a «! • 

•• a 

«’ z 

h-1 


L \i=l 



/ \j=i 



0. 


In particular, if (• | ,..., (• | -) n are inner products in ^ ,..., 

respectively, then we have in (6.27) g {k) = 0*.. Consequently, from (6.28) 
we can deduce that (/1/) = 0 if and only iff = 0. 

In summary, we can state the following. 
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Theorem 6*8* If (• | ^ (• | -) n are bilinear forms on , 

respectively, then the complex function (/1 g) defined by (6.22)-(6.24) 
is a bilinear form on ^ ® a ••• ® a 1 r n \ (' I *) Is Hermitian, positive 
definite, or an inner product if all (• | ^ (• | -) n are, respectively, 

Hermitian, positive definite, or inner products. 

*6.5. Hilbert Tensor Products of Hilbert Spaces 

Definition 6*2* Let M P 1 ,..., 34? n be Hilbert spaces with inner 
products <• | •>! ,..., <• | -> n respectively. In the algebraic tensor product 
34?! ® a ®a^n °f >•••> ^ n denote by <• | •> the inner product 
which has the value 

(6.29) </|*> = A <f {k) \g {k) >k 

k=1 

for / and g of the form 

/ = /<!> (X) - g = £ (1 > 0 - 0 g in) \ 

(6.29) defines a unique inner product by Theorem 6.8. The Hilbert 
tensor product 34? x 0 ••• 0 ^? n of the Hilbert spaces ,..., n is the 
Hilbert space which is the completion of the Euclidean space $ with 
the inner product (6.29). 

Consider now two Hilbert spaces L\Q 1 , /x x ) and L%Q 2 , /x 2 ). Denote by 
fjL ± X /x 2 the product of the measures ^ and /x 2 (as defined in Theorem 2.4) 
on the Borel subsets of Q 1 X Q 2 • If / eL 2 (Q 1 , /x x ) and g eL 2 (Q 2 , /x 2 ), 
then f(xf) g(x 2 ) represents an element of L 2 (Q 1 x , Pi X /x 2 ), which 
we denote by / • g: 

(6.30) (/ • £)(*!, x 2 ) = /K) |-(x 2 ). 

Theorem 6*9* The linear mapping 

Iu-> ft, h e 3^[ ® a 3^ 2i fie 

(6.31) 

71 71 

A = Z ® 8k , & = Z . 

fc=l k=1 

of the algebraic tensor product of = L 2 ^!, /x x ) and Jf" 2 = L 2 (Q 2 , /x 2 ) 
into 3^ z = L 2 (fi! xfl 2 ,^x /x 2 ) can be extended uniquely to a unitary 
transformation of 34? x 0 34? 2 onto 34 ?%. 

Proof . It is easy to see that (6.31) is a linear mapping of ® a ^2 
into . 
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According to Theorem 4.8, the linear set 

! n } 

X a kXs t (x): S k = I k C\ Qi, a k e C 1 , k = 1,..., n, n = 1, 

(where Q 0 — Q 1 X Q 2 > an d are intervals in the Euclidean space 
containing the set Q t — 1,2) is dense in . Therefore (see Exer¬ 
cise 6.9), 'V'i ®a ^2 is dense in 3tf > 1 ®) a , and consequently also in 

® ^2 - 

The mapping (6.31) maps ® a ^ 2 into ^3 • Furthermore, for any 
Xr- > Xs . G we can write, if we denote by <• | •) the inner product in 
® & 2 , 

(Xr 1 ® Xr 2 I Xs x ® Xs 2 > 

= f Xtf^i) Xs&i) ^i(*i) f Xi? 2 (^) Xs 2 (* 2 ) dfi 2 (x 2 ) 

= I Xtf^l) XR&z) Xsf* 1 ) Xs.fo) X /X 2 )(X! , x 2 ). 

From the above we can deduce that if/i->/ and g g, then 

<f\g> = f /(•*) i( x ) 1 X ih)(x), f, g e ®a *1, 

J o 3 

i.e., the mapping (6.30) is an isomorphism oii r x ®i r 2 onto ^ . Since 
® ^2 is dense in J(? 1 ® and ^ is dense in , this isomorphism 
can be extended uniquely (cf. Exercise 6.10) to an isomorphism between 
M\ (x) and ^ 3 . Q.E.D. 

Theorem 6*10* The Hilbert tensor product ® ••• ®^ n of 
separable Hilbert spaces ,..., is separable; if {e^: ie U fc } is an 
orthonormal basis in the , then {e { ^ ® ••• ® e^: i x g U x ,..., i n g U n } 
is an orthonormal basis in ® ••• ® . 

Proof. If is separable, then there is a countable orthonormal basis 
{f?^: i G I!*.} in 34? k . The set 

T = {4? ® ••• ® ««r ); *1 6 Hii n e UJ 

is also countable. In addition, T is an orthonormal system: 

<*% ® - ® | @ ••• @ ej*>> = <*£> | ej?\ - <e£> | e%\ 

~ S <ih 8 W« • 
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In order to see that T is a basis in x (x) ••• (x) n , note that the linear 
manifold S k — (e[ k) , e !j fc) ,...) spanned by {e^\ } * s dense in 3ri? k , 

and therefore (x) a ••• , which is contained in the linear space 

(T) spanned by T, is dense (see Exercise 6.9) in (x) ••• (x) n . 

Q.E.D. 

In the next section we shall find useful the following generalization 
of the preceding theorem. 

Theorem 6*11 ♦ If {e { : i e U} is an orthonormal basis in the separable 
Hilbert space and {e ^ : j e 93} is for each value of the index i gU 
an orthonormal basis in the separable Hilbert space J^ 2 > then 
T — {e i (x) : j e 93, i e U} is an orthonormal basis in x . 

Proof . T is an orthonormal system because 

(X) Bi i j i | e^ 2 (X) | I I 2 

Let / be any given element of (x) • Since (x) e' lk : i e U, k e 93} 

is by Theorem 6.10 a basis in , for any given e > 0 there is a 


for which 


m 

g = £ <*<**< ® 

i,fc=l 


II/-*11 <*/2. 

As each T* = :/ e 93} is a basis in , we can find finite linear 

combinations YljLi ^ikj e ij approximating e' lk : 

n i / m 

\\ e lk ~Y H2 < 9 ( Y \ a ik\ 

j =1 ^ \i,k =1 

Consequently 

I . m rii / m 

/— S Z IE a ik b ikj 

1 i=l i=l \fc=l 

w II / w< Ml 

^ 11/ ~ g II + Y I a ik I e i ® \ e lk ~ Y ) < e * 

i.fc -1 11 N J -1 711 




Thus, (T) is also dense in (x) , i.e., [T] = (x) % . Q.E.D. 
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Exercises 

6*1 ♦ Prove thatL 2 ((— oo, 0]) ©L 2 ([0, -f oo)) is isomorphic to L 2 (U X ). 

6*2* Show that if $ x ,..., $ n are Euclidean spaces and k x ,..., k n is any 
permutation of the indices then S x © ••> @ is isomorphic to 

<? fci © • • ■ © <£ kn . Prove the same result for the countably infinite case, 
when instead of k x k n one takes an infinite sequence k x , k 2 ,... in 
which each positive integer appears once and only once. 

6.3. Show that the Euclidean spaces $ x © (<o 2 © <Q, S x © 
and {S x © <Q © <f 3 are isomorphic to one another. 

6.4. Show that (@* =1 <4') © (@1=1 4") and ©*=i 04' © <4") are 
isomorphic Euclidean spaces. 

6.5. Show that if the complex function (f\g) satisfies (/ j g + h) = 
(/1 £) + (/1 *) and (/1 <^) = a{f \ g), (f\g) = (g |/)*, then (/| £) is a 
Hermitian bilinear form. 

6.6 ♦ Prove that a bilinear form (• j *) is Hermitian if and only if its 
quadratic form Q(f) is real for all fe . 

6*7* Show that if we state that “A (1) X ••• X h in) ~g {1) X ••• X g in) 
if and only if h (i) — cg (i) , h^ ] = (1 /c)g (j \ for some cgC 1 and for a 
certain value of i and a certain value of j (i, j — 1,..., n),” then the 
relation ~ so defined is an equivalence relation. 

6.8. Prove that the “equality” relations in (6.16), (6.17), and (6.18) 
are equivalence relations in $ x X * • • X $ n . 

6.9. Prove that if S x and S 2 are dense vector subspaces of 3t? x and 
, respectively, then S x (x) a S 2 is dense in (x) a J ^ 2 , and therefore 

also in 3f x (x) Jtf p 2 . 

6.10. Suppose that Y' x and i^ 2 are dense vector subspaces of the 
Hilbert spaces and $f 2 , respectively. Prove that a linear mapping 
/•->/of ^ onto i ^ 2 , which preserves the inner product, can be extended 
uniquely to a unitary transformation of £F X onto &tf 2 . 

7. The Two-Body Bound-State Problem 
with a Spherically Symmetric Potential 

7.1. Two Particles Interacting via a 
Spherically Symmetric Potential 

When a system consists of only two particles of different kinds, the 
NtfitcH are represented by wave functions ifj(r x , r 2 , t) r eL 2 (IR 6 ), in 
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the six configuration space variables r x and r 2 . The time-independent 
Schroedinger equation is in this case 

/ h 2 h 2 \ 

(7 - 1} r ^i Ai + - r ^)) ^. *.) = E *(*i > r 2), 

where, on physical grounds, only potentials depending on r x — r 2 
exclusively are considered* (see §5). 

After going over to new coordinates, specified by the relative position 
vector r and the center of mass position vector R, 


(7.2) 


r = r 2 — r x , 


+ m 2 r 2 
m 1 +m 2 ’ 


we get as a special case of (5.18) 


Hs = 
(7.3) A r = 
A = 

where M 0 and 
of the system: 


h 2 
2 M 0 


a 2 

ex 2 


+ 



m 0 are, 


A --2|- A + ^ 

e 2 a 2 

a 2 a 2 

-gyf+ r = *e.+ye, + *c, t 

respectively, the total mass and the reduced mass 


(7.4) 


Mo = m i + m 2, 


mi m 2 

# m 1 + m 2 ' 

If we seek solutions of (7.1) of the form 


0(r 1 ,r 2 )=WR) , ?(r), 
we get as a special case of (5.20) and (5.21) 


(7.5) 


£'c‘MR), 

(7.6) 

(-5T d a + f < , '>)' p «' 

= E b W( r). 


* In phenomenological treatments of nuclear forces, velocity-dependent potentials are 
also considered. In that case the more general requirement of translational invariance 
is always imposed. 
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The equation (7.5) is uninteresting to us, because it is related, as we 
already know from §5, only to the motion of the center of mass of the 
two particles. 

Concerning (7.6), in practice we are most often faced with a spherically 
symmetric or central potential, i.e., a potential 

V{r) — F 0 (r), r — | r | = V x 2 + y 2 + z 2 , 

depending only on the distance r of the two particles. This considerably 
simplifies the problem. 

We shall treat this problem in the rest of this section in a detailed 
manner, because it provides a fair sample of the main methods used in 
solving the eigenvalue problem of the time-independent Schroedinger 
equation. It should be noted that in attacking this problem we will find 
that it is necessary to employ most of the main theorems obtained thus 
far. 


7.2. The Equation of Motion in Spherical Coordinates 
Let us write (7.6) in the form 

(7.7) AW(r) + (2m 0 lh*)(E - V 0 (r)) W(r) = 0, 

where we have simplified the notation by leaving out the subscript b 
in the internal bound-energy eigenvalue E h . 

It is natural to try to solve (7.7) by going over to the spherical 
coordinates r, 0, 0, 

(7.8) x = r cos 0 sin 0 , y = r sin 0 sin 0 , z — r cos 6 , 

in terms of which the Laplacian A becomes (see Exercise 7.1) equal to 


r 2 


/ 2 d\ ,_ d_ 

.dr \ dr) sin 6 dO 




almost everywhere, i.e., with the exception of the z axis, which is a set 
of zero Lebesgue measure. 

If we express the function/(#, y, z),feL 2 (R 3 ), in terms of the spherical 
coordinates r, 0 , <f >, 


(7.9) /p(r, 0 , (f >) = f(r sin 0 cos </>, r sin 0 sin </>, r cos 0 ), 

we note that f v (r, 0> </>) is defined on the closed subset Q v of 1R 3 , 

(7.10) - {(r f 0,0): 0 ♦ r < | oo, 0 < 0 < tt, 0 < 0 ^ 2tt}, 



150 


II* Measure Theory and Hilbert Spaces of Functions 


and satisfies the periodicity condition 

(7.11) f v (r, 6 , 0) = / p (r, 6 , 2v). 

It is well known that the Jacobian of the transformation 
(*, y, z) -* (r, 6, <f>) is 

J(r, 0,</>) = ** sin 0. 

Thus, if we introduce the Lebesgue-Stieltjes measure 

(7.12) /*pCB) - f r 2 sin 0 dr dO d<j>, B e & a , 

J B P 

on the Borel subsets of Q v , then by Theorem 4.9 we have 
f \f(x,y,z)\ 2 dxdydz= f \f p (r, 0,<f>)\ 2 dix p (r, 


i.e.,/ p eL 2 {Q v , /x p ). It is easy to see that, in fact,/—>/ p is an isometric 
linear mapping of L 2 (Q) into L 2 (Q V , £i p ). Furthermore, the range of this 
mapping is L 2 (Q V , /z p ), because if / p (r, 0 , </>) eL( 2 )(Q v , /x p ) satisfies the 
periodicity conditions (7.11), then there is an feL 2 (Q) which is mapped 
into / p ; but since the sets {(r, 0, </>): 0 < r < -f oo, 0 = 7t, 0 < </> < 27t) 
and {(r, 0, </>): 0 ^ r < -f- oo, 0 < 0 ^ 77 , </> = 277 } obviously are of /x p 
measure zero, every equivalence class / eL 2 (Q v , /x p ) contains a function 
which satisfies (7.10). Thus, we can state Lemma 7.1. 

Lemma 7*1* The Hilbert spaces L 2 ([R 3 ) and L 2 (Q V , /z p ) are iso¬ 
morphic, and the mapping/—>/ p defined by (7.9) is a unitary trans¬ 
formation of L 2 ([R 3 ) onto L 2 (Q V , /x p ). 

We can now try to solve (7.7) by the method of separation of variables. 
We look for solutions of (7.7) of the form 

(7.13) V(r) = R(r)Y(0,4>). 

Thus, we arrive at the system of equations 


(7.14) 

(7.15) 


1 d 
sin 0 30 


^sin 0 


dY(0,<t>) \ 1 3 2 Y(0,<t>) 

30 / sin 2 0 3(f> 2 


A ¥(0,4), 


J_ d_ 

r 2 dr 


- 2 ^L\ + £ R{r) Ur)) R{r) = 0. 


dr ) 


We note that (7.14) is an eigenvalue equation for A, in which the 
potential does not enter. Therefore, we are able to investigate (7.14) 
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without knowing anything more specific about our two-body problem, 
except for the assumption that the potential is spherically symmetric, 
i.e., it depends only on r. 

On the other hand, the equation (7.15) for the radial part R(r) of the 
wave function contains F 0 (r), and its solution can be approached only 
when the potential is specified. We shall investigate at the end of this 
section the solution of (7.15) for the Coulomb potential V 0 (r)=c/r y 
where c is a constant. 

Consider the sets 

Q r — [0, +°o) C [R 1 , 

Q s = [0,77] x [0, 2 tt] C [R 2 , 

and the Lebesgue-Stieltjes measures /x r on 38^ and /z 8 on 38 defined by 


(7.16) 


Mr (B) = f r 2 dr, 

J B 

/x s (J5) = f sin 0 d6 d<j>, B e 38 % s . 

" B S 


If ^(r), W e L 2 ([R 3 ), is square integrable in the Riemann sense and if it 
can be written in the form (7.13), then we have 

f | ^(r)! 2 dr = f | R(r) Y(0 , <£)| 2 r 2 sin 0 dr d0 d<$> 

^ K 3 J 

/.00 /.7T «2 IT 

= R(r)\ 2 r 2 dr | | Y(d, ^)| 2 sin 6 dd d<f> 

J Q J o J 0 

= f \R(r)\ 2 d^(r) f | Y(0,4>)\ 2 d^(0, </>), 

J n T J n s 


which implies that R eL 2 (Q v , /x r ) and Y eL 2 (Q s , /x 8 ). More generally, 
by using Theorems 2.4 and 6.9 we can infer the following result. 

Lemma 7*2* The measure space (Q v , 38 , /x p ) is the Cartesian 
product of the measure spaces (£? r , 38 , /x r ) and (Q s , , /x 8 ), and 

therefore the Hilbert space L 2 (Q V , /x p ) is isomorphic to the tensor 
product L 2 (Q t , /x r ) ® L 2 (Q 8 , /x 8 ) of the Hilbert spaces L 2 (Q T , /x r ) and 
» Ms)' 


7.3. Spherical Harmonics on the Unit Sphere 

We note that since 0 and </>, ( 0 , </>) e Q B , can be considered to provide 
a system of orthogonal coordinates on the unit sphere in [R 3 , we can 
al ways visualize a function Y(0 , <ji>) for which 


(7.17) 


Y(0 9 0) = F(0, 2tt) 
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as being defined on the unit sphere. A special class of function of the 
above type are the spherical harmonics 


(7.18) 


5T(M) = (-i) m [■ 


21 + 1 (l 


4)11 


1/2 






l — 0 , 1 , 2 ,..., m — —/, —/ + 1 ,..., +/, 


where Pf(u) are the associated Legendre functions of the wzth order,* 
which can be given by Rodrigues' formula : 

(1 _ M 2\m/2 d l + m 

(7.19) PT{u) = — (#• - 1)*, m = 0, 1, 2,.... 


The complete solution of the eigenvalue problem for (7.14) results in 
the following important theorem. 

Theorem 7*1 ♦ The spherical harmonics (7.18) are eigenfunctions 


of (7.14) 

1 3 


(si 


sin 6 


wr(e,<t>) 


)+4 


d 2 Y?(e,<l>) 


sin 0 80 \ 80 1 sin 2 0 8<f> 2 

and they determine an orthonormal basis 


-1(1 + 1) YT(0,4), 


X = {Fr, / = 0, 1 , 2 ,..., m = -l, -l + 1,.., +/}, 

(7.20) 

l»7T , 

I f (YT(&, 4))* Y?(0,4) sin 0d0d<f,= , 

J 0 J 0 

in the Hilbert space L 2 (Q 8 , /x 8 ). 

We shall carry out the proof of Theorem 7.1 in a few stages, thus 
being able to state and prove a few results which are of independent 
interest. 

In trying to solve (7.14) by the method of separation of variables, we 
look for solutions of the form 

(7.21) y(M) = ®(0)«K# 

Thus, we arrive at the following system of independent differential 
equations: 

(7.22) + v-m = 0, 

(7.23) sin 0 ~ (sin 0 - (A sin 2 0 + p) 0(0) = 0. 

* The associated Legendre functions Pi°(u) of the zero order are called the Legendre 
polynomials. 
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By applying again Theorems 2.4 and 6.9 we get Lemma 7.3. 

Lemma 7 3* The Hilbert space L 2 (Q B , /x s ) is isomorphic to 
L*([0, 7 t], jx e ) (x)L 2 ([0, 277]), where 

jjL 6 {B) — f sin 6 d9, B e ^\ 0 , 2n ) . 

J B 

1A. The Completeness of Trigonometric Functions 

When the periodicity condition (7.17) is imposed on (7.21), it becomes 

(7.24) 0(0) = 0(2tt). 

The only linearly independent solutions of (7.22) satisfying (7.24) are 
e im + 9 m = 0, +1, +2,..., 
and they do belong to L( 2 )([0, 27 t]). 

Theorem 7*2* The family {0 m : m = 0, ±1, ±2,...} of elements of 
L 2 ([0, 2tt]) corresponding to the functions 

= (1/V2tt) e*™* 

is an orthonormal basis in L 2 ([0, 2tt]), 

The fact that {0 O , 0 ±1 ,...} is an orthonormal system is easy to 
establish: 

1 c 2n 

I #m'> = 2 ^ J ex PD‘K “ • 

In order to prove its completeness we need Theorem 7.3. 

Theorem 7 3* If/(</>), </> e [0, 27t], is a continuous periodic function 

(7.25) /(0) =f(2n), 

with a piecewise continuous first derivative, then 

(7.26) m = lim4- t <*» l/> 

oo V ZlT m=- n 

and the convergence of o-^), o- 2 (</>),..., 

v -477 m—w 


to/(</>) is uniform in [0, 2 tt]. 
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Proof. Since /' eL 2 ([0, 2 tt]), /'(</>) = and {<2> 0 , <P ±1 ,...} is 

an orthonormal system, we have by Bessel’s inequality (see Chapter I, 
Lemma 4.1), 

+ 00 

(7.27) £ |<# m l/')l 2 < II/' II 2 - 

m=—oo 

On the other hand, integrating by parts we get 


<®»i/'> = -4- Crwe-^dt 

V2tt j o 

= ^ /['/(A dt = rn<4> m |/>. 


Therefore, by (7.27) 

+00 

(7-28) £ m 2 |<4> m |/>| 2 < ||/' |! 2 . 


To prove the uniform convergence of 

+n 

a ni$) X! ^m^m(^)j == m I /*X 

m=—n 

use the Schwarz-Cauchy inequality in / 2 (2v) to obtain 


°n+v{4>) ~ °n(4>) I 2 = £ Cm<P m (<j>) 

' |m|=n+l 


^ $ (6) |2 
= £ (mc m ) mW 

\m\=n+l 


m 


< ( I 

\|m|=n+l / \|m|=ra+l / 

1 W+l> 1 

< 5 ii/'i i i 


|m|=w+l 




where the last inequality follows from (7.28). Since the series 1 /w 2 
converges, for any € > 0 we can find an N(e) such that 

n+v i oo * 

S I = s<2«||/'||-« 


|ra|—n+1 


ra—n+1 
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for all n > N(e), and therefore the uniform convergence of or n (x) follows. 
In order to show that a^), a 2 (</>),... converges to/(</>), note that 


l n 2 tt 

o n (</>) = 2 ^ S exp (im4>) j f(r) exp(— imr) dr 

1c 2 ”-/” \ 

= 2 ^ J o /W ^ Z exp [»»(</> - r)]j dr 

1 r 2 ” 

= — /( T ) Re (i + exp[*(^ — t)] + ••• + exp[m(<£ — r)]) dr 

rt J 0 

= 1 Cf(r) Re ( eX P^+ 1 ^-T)|- . 1 _ \) dr 
7T J o W \ exp[)(</> — r)] — 1 2/ 

= J_ J a ”/( T ) Re ( 4 - exp[/(« + - T )1 - cxp[ - r)] 


sin \(<f> — r) 


-«) 


dr 




J »2ir—<t> 


-hi 


where in the last integral we have introduced the new variable co = r — </>. 
We extend the domain of definition of/(</>) to the entire real line: 


+ 27m) =m. 


± 1 , ±2,... . 


The function so extended is periodic (with a period 2tt), everywhere 
continuous, due to (7.25), and has a piecewise continuous derivative. 
We can now write 

z , v 1 r 2jr ,,, , v sin(« + i)o> , 

== 2 7 /J 0 /W+ ° j) 1bW2) dw - 

Since we have 

1 r 2n sin(w + J)o> 


/»<S77 

27tJ 0 sin(o>/2) 

we deduce from the above two formulas that 

1 


dw 


1 r 2,r 
7T J o 


COS CO 


cos na>) doj = 1, 


(7.29) 


*«-/»> 




cu/2) 

i f 2 - //(^ + w) -m 


2")co doj 


2tt 




cos(a>/2)j si 


sin(o>/2) 

1 r 2n 

I + <«) — /(</>)] COS »o) dw. 


sin wco dw 



156 


II* Measure Theory and Hilbert Spaces of Functions 


where the function 




sin (coj 2) 


is piecewise continuous, since 


lim g^w) = lim ---------— cos (^/2) = 2 

a>-*±o 61v ' a>-*±o o> sin(o>/2) v 1 9 J vr y ’ 


while 


£a(") = /(<£ + —/(") 


is obviously continuous. 

Thus, both functions £i(co) and £ 2 ( w ) are integrable and we have 


1 c 2n 

(7.30) o n (<j>) -/(</>) = ^~ J ^ [«“"(^ 2 (o J ) - ^(a.)) + e~ inw (g2(oj) + ig-^Wu 


2(2ir) 1 / 1 


i<®-« I £2 - *?i> + 


1 


2(2tt) 1/2 


<<P» I ga + igi>- 


Since on the right-hand side of (7.30) are the Fourier coefficients of 
functions belonging to L 2 ([0, 2 77]), from Bessel’s inequality we can infer 
that they converge to zero when n —> + 00 . Q.E.D. 


It is easy to finish the proof of Theorem 7.2 with the help of 
Theorem 7.3. 

If / is an arbitrarily chosen element of L 2 ([0, 27 t]) and € > 0 is any 
given number, then by Theorem 5.6 there is a g 1 e ^(IR 1 ), which can 
be represented by an infinitely many times differentiable function 
giix), x £ (R 1 , such that 

«+°° 

II/1 - gi 111 2 = I AM - giMi 2 dx < € 2 / 9 , 

J —00 


where 



0 ^ X ^ 277 
x ^ [0, 27t]. 


If we introduce the function 



0 X ^ 277 
x $ [0, 2?r], 


which is infinitely many times differentiable in [0, 27 t], we have 


/» 2 tt /•+ 00 

(7.31) 11/ -£||*= I I m - SMI 2 # < f I AM - £iMI 2 * < * 2 /9. 

0 ~aj 
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We can approximate £(</>) in the mean as closely as desired by a 
continuous and piecewise differentiable function /*(</>), <f> e [0, Iv], which 
is periodic, i.e., h( 0) — h(27r). For instance 

( g(<f >), </>0 <<f><27T, 

is such a function, for which 

(7.32) || g — h ||» = f*° I g(2n) + s(4>o)~g(2n) ^ _ g{(/>w d4> < e2/9 

J o 9o 

if cf )o is chosen sufficiently small. Since h{<j >) satisfies the conditions of 
Theorem 7.3, for sufficiently large n we shall have 

\K4>)~ £ cM<j>)\< elVWr, c m = <0 m | h\ 

m=—n 

and consequently 


d<f> < e 2 /9. 


n 2 r 2n i ™ 

(7.33) h - £ c m <P m = j *(*)-£ cMt) 

m=—n 0 m=—n 

By combining (7.31), (7.32), and (7.33) we deduce that 

n || j. n II 

£ e m 0 m <||/ —*||+||*-All + A- £ c m $ m < e, 

m=—n 1 , m=—n 

which establishes Theorem 7.2. 


7.5. The Completeness of Legendre Polynomials 

We can use Theorem 7.2 to prove the Weierstrass approximation 
theorem , which will be used later. 

Theorem 7 A (Weierstrass ). Any complex function /(#), xel, 
which is continuous in a closed finite interval I — [ a , b ] can be uniformly 
approximated in I by a polynomial P(x) in x; i.e., for any given e > 0 
there is a polynomial P(x) with complex coefficients, such that 

(7.34) |/(*) - P(x)\ < €, a < x < b. 

Proof. Consider the case when 0 < a < b < 2tt. Choose a partition 

0 — x Q < x t ^ a < x 2 < ••• < A? w -i = b < x n = 2 tt 
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of [0, 2 ?t]. The function 
x 

~f( a )> x 0 = 0 ^ x < — a 

[/(**) - /(**-1)]+/(**-1). 

x fc x fc-i 

x fc-i ^ x < x k , k = 2,..., w — 1 

— *n-l = * < * < 2 tt, 

represents a polygon inscribed in the graph of/(#) by joining the points 
(#! ,/( x i))v, (^n-i >f( x n- 1 )) with straight lines, and then joining the 
ends to the real axis at x 0 = 0 and x n = 277. Such functions £(#) can 
approximate uniformly and arbitrarily well the continuous function f(x) 
(recall that a continuous function is uniformly continuous in a closed 
finite interval). Since the function g(x) in (7.35) satisfies the conditions 
of Theorem 7.3, it can be approximated uniformly by a trigonometric 
polynomial 

n 

K X ) = Z C ™ e * mX - 

m=—n 



The above trigonometric polynomial can be, in its turn, approximated 
uniformly and arbitrarily well by polynomials, since due to Taylor’s 
formula 


_ imx 

— i + -jr + 


+ 


( imx) k ( imx) k+1 


k\ 


(* + W 


(cos mQ^x + i sin m0 2 x)> 


0 < @ 1,2 < 1 , 


each e imx can be approximated uniformly in [0, 277 ] by ordinary poly¬ 
nomials. Thus, the theorem is proved for the case when 0 < a < b < 2tt. 

The general case of [a, b ] can be reduced to the above case by mapping 
linearly [a, b ] into [0, 277 ], e.g., by using the mapping 


x — a ,77 

+ 2* 

and by noting that a linear mapping maps polynomials into poly¬ 
nomials. Q.E.D. 


We note that if (7.34) is true, then 


r /•& -i 1/2 - 

11/ - P || = [J I /(*) - P(*)l a dx\ <e Vb-a. 


Consequently, from Theorems 5.6 and 7.4 we can easily infer the 
following lemma. 
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Lemma 1A. The vector subspace ( p Q , p x ,...) of L 2 ([a , 6]), spanned 
by the elements of L 2 ([a , 6]) represented by the polynomials 

Po( x ) = Pi{x) ~ %>•••> Pn( x ) ~ 
is everywhere dense in L 2 ([a, 6]). 

We can turn now to solving the differential equation (7.23), which in 
the new variable 

(7.36) u — —cos 0 , —1 ^ u ^ 1, 

becomes (with fi — — m 2 ) the associated Legendre equation 

(1 ~ u2) iu I * 1 - U<1) ^*r\ ~ w 1 - “ 2 ) - m2 i = 0 

for the new function 

(7.37) w(u) = ©(arc cos(— u)). 

The above associated Legendre equation (called simply the Legendre 
equation when m — 0) can be written in the more customary form 

(7.38) (1 — u 2 ) w"(u) — 2u w'(u) — |a — —- ^ 2 j w(u) = 0. 

We can state the following direct consequence of Theorem 4.9. 

Lemma 7*5* The mapping © -> w, defined by (7.37), is a unitary 
transformation of L 2 ([0, 77 ], /x 0 ) onto L 2 ([—1, +1]). 

Theorem 7*5* The Legendre polynomials 

(7.39) = 1', / = 0, 1, 2,..., 

satisfy the Legendre equation with A — — Z(Z + 1): 

(7.40) (1 - u 2 ) P'\u) - 2uP'(u) + /(/ + 1) P*(«) - 0. 

The set {P 0 , P 1 ,...} CL 2 ([—1, 1]) is an orthogonal basis inL 2 ([—1, +1]), 
with 

( 7 . 41 ) f\(u)P l iu)du = 2 TfY 8 "'- ' 

Proof. To derive the result that Pi(u) satisfies (7.40), we start with 
the identity 

(I u 2 )(d/du)(u 2 ~ 1)' - 2lu(u 2 - \) 1 = 0. 
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By taking, with the help of the Leibniz rule, the / + 1 derivative of the 
above equation we arrive at (7.40). 

By employing the binomial formula we get 




1 » ( — 1)*/! d l 

2HI k\{l - k)\ du l 


u m-k) 


from which we derive, after carrying out all the differentiations, 


(7.42) 


Pi(u) = 


{1/ y~ S,) (—l) fc (2Z - 2k)\ 

& 2 l k\(l — k)\(l — 2k)\ 


_ (0, / even 

* ~ (1, / odd. 


The above formula shows that P x is a polynomial of degree /. Integrating 
by parts l times consecutively and using the fact that 


d k 

we arrive at the relation 


u=± 1 


0 , k = 0,1,...,/- 1, 


<P,IPm> 


2W C&V- 


du 


d l u m 

1 ^~du rdu ' 


From the above we get <P* | /> m > = 0 for m = 1,..., I — 1. Consequently, 
<P* | P r > = 0 for V < /, i.e., 


<p I I Pi-> = f 1 Pi(u) Pv(u) du= 0, / ^ /', 

J -i 

which proves that the system P 0 , P 1 ,... consists of mutually orthogonal 
elements of L 2 ([—1, +1]). It is easy to check (see Exercise 7.2) that 
(7.41) is true for l = i.e., the Legendre polynomials are mutually 

orthogonal but they are not normalized. 

From (7.39) it is obvious that p Q {u) = 1, p^u) = u,...,p n (u) = u n 
can be expressed in terms of P 0 (z/),..., P n {u). Consequently the vector 
subspace (p 0 , p 1 ,...) of L 2 ([— 1, +]]) spanned by p 0 , p t ,... is identical 
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to the vector subspace (P 0 , P 1 ,...) spanned by the elements of 
L 2 ([— 1, +1]) represented by Legendre polynomials. Since according 
to Lemma 7.4 ( p 0 , p 1 ,...) is dense inL 2 ([—1, +1]), we have 

[P„ , P x ,...] = [ Po , Pl ,...] = L\[- 1, +1]). Q.E.D. 


Theorem 7*6* The associated Legendre functions 


(7.43) PT(u) 


(1 — w 2 ) m/2 d l + n 


2H\ 


du l+ri 


(u 2 - iy. 


l = m,m + 


defined for all positive integral values of m, are solutions of the associated 
Legendre equation, 


(7.44) (1 - u 2 ) 


d 2 PT(u) 
du 2 




m 


1, 2, 3,... 


For a fixed integer value of m the set {P™ , P™ +1 , P™ +2 >•••} is an ortho¬ 
gonal basis inL 2 ([—1, +1]), with 

(7.45) <P," | P??> = £ P?(«) P?{u) du = S(j- . 

Proof. By substituting in the left-hand side of (7.44) 

we get by repeated use of the Leibniz rule 


d m 

dti™ 


[(1 - w 2 ) 


d*P t (u) 
du 2 


2w 


^»(«) 

du 


+ /(/+1)P,(ii)] 


= 0 . 


Thus, we deduce that (7.44) is true from the fact that (7.40) is true. 

It can be shown by direct integration that (7.45) is true, i.e., 
{P™ , P™ +1 ,•••} constitutes an orthogonal system in L 2 ([— 1, +1]). In 
order to prove that this system constitutes a basis inL 2 ([—1, +!])> note 
that for any given function/^),/eL 2 ([—1, +1]), we can write 

p+1 /7 m 

(7.46) <pr i/> = / (i - « 2 r 2 /(«) ^ p,(«) d«. 

f 

From (7.42) we can easily see that p 0 (w) = 1> Pi( u ) = “>•••> P„( M ) = u n 
can be expressed in terms of 

d m P m (u) d m P m+ i(u) d m P m+n (u ) 

du m ’ du m . du m 
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i.e., the vector spaces spanned by these two systems are identical: 


(Po,Pi >•••) - ( 


d m P m d m P„ 


du m ’ du m 
Thus, we deduce from (7.46) that the relations 




(7.47) =0, Z = m,m + 1,... 

imply that for h(u) — (1 — u 2 ) m ! 2 f(u) we have (g | K) — 0 for all 
g e (p 0 , p 1 Since according to Lemma 7.4, (p 0 , p 1 ,...) is dense in 
L 2 ([-l, +1]) and since obviously h eL 2 ([- 1, -f-1]) when/eL 2 ([- 1, +1]), 
there is a sequence g ± , g 2 ,... e (p 0 , p 1 ,...) which converges to h. Hence, 
if (7.47) is true, we shall have 

<h | h) = lim<^ | h} = 0. 


The above relation implies that h — 0, i.e., A(w) = 0 almost everywhere. 
Consequently, f{u) = (1 — u 2 )~ m ! 2 h(u) — 0 almost everywhere, because 
(1 — u 2 )~ m ! 2 is finite and different from zero everywhere in ( — 1, +1), 
while the Lebesgue measure of {—1, +1} is zero. 

Thus, we conclude that (7.47) implies that / = 0; consequently, 
{P^ , P ^ + 1 ,...} is according to Theorem 4.6(a) in Chapter I an orthogonal 
basis in L 2 ([-l, +1]). Q.E.D. 

7.6. Completeness of the Spherical Harmonics 

Now we are in the position to carry through to the end the proof of 
Theorem 7.1. Due to Lemma 7.5 we can deduce from Theorems 7.5 
and 7.6 that for each fixed nonnegative integer m the functions 

I®™ - [- 2/+1 '-»•"+>••••] 

determine an orthonormal basis in L 2 ([0, 77 ], /x 0 ). By applying Theorem 
6.11 to the orthonormal bases {0 m : m — 0, ±1,...} (see Theorem 7.2) 
and l = | m |, | m | + 1,...}, m — 0, ±1,..., we deduce that the 

system 

= 0, 1, 2,..., m = -/. -/ + 1,..., +/} 

is an orthonormal basis in L 2 ([0, 77 ], /x e ) ® L 2 ([0, 2tt]). From this result 
and Lemma 7.3 (see also Theorem 4.8 in Chapter I) we can finally 
conclude that the spherical harmonics determine an orthonormal system 
in L\Q S , fx s ). 

We can easily derive from Theorem 7.1 the following result. 
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Theorem 7 *7* The set 


{0, -2, —6,..., —/(/ + 1),...} 

contains all the eigenvalues A of the equation (7.14) belonging to eigen¬ 
functions Y(0 , <f >), ( 0 , <f>) e Q & , having continuous second derivatives in 
,Q S , and satisfying the following periodicity conditions: 


Y(0,0)=Y(0,2ir), 


d<l> <t>=o+ 1 ^= 277-0 * 


Denote by ^(Q & ) the set of all elements of L 2 (Q & , /x s ) which are 
represented by such functions Y(0, <f>). Any eigenfunction Y(0, </>), 
FG?p(fl g ), corresponding to the eigenvalue A= — /(/ -f 1) is equal, 
almost everywhere, to a linear combination of the spherical harmonics 


rr ! (0, 


Proof, It is easy to check explicitly by integrating by parts that the 
differential operator* form 


( 7,4S ) h * U sin e 89 ( Sln 9 8d) + sin 2 # 8<f> 2 

defines a Hermitian operator when applied to twice continuously 
differentiable functions Y(0, <f>), where Y e 

f ^i*(0, ^)(L 2 T 2 )(^, (f>) dfx s (0, <j>) , 

J o 
^s 

= f (VYiWO, 4) Y 2 (6, <j>) d^(d, 4>), Y lt Y 2 e C 2 P (G S ). 

J Q 
^s 

Consequently, we can infer, by the procedure used in proving Theo¬ 
rem 5.3, that any two eigenfunctions of — L 2 corresponding to different 
eigenvalues are orthogonal. 

If T(a)( 0, </>) is an eigenfunction of (7.14) 

(-L 2 F (a) )(0, t) = A h*Y U) (0, <f>), y w eL*(Q s , / x !B ;), 

we can expand it, due to Theorem 7.1, in terms of the spherical harmonics 

(7.49) y a = f £ <y ( “ i y w > vr- 

i —0 ra—-i 

* As we shall see in Section 7.8 of Chapter IV, the operator L 2 corresponds to the 
square of the angular momentum L of the system. 
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However, according to the above-mentioned property of eigenfunctions 
corresponding to different eigenvalues 

Wl y ( a) > = o, a^= —/(/ +1). 

Consequently, Y( A ) can be a nontrivial eigenfunction only if A is equal 
to —/(/ -f 1) for some / = 0, 1,... . In that case we are left in (7.49) 
with the finite vector sum 

+i 

IV-ZU+l)] = X C mX\ n, '> C m = I Y[_ i(i + i )]>. 

m=—l 

From the definition of equality of functions in L 2 spaces we get 
^t-iO+D#. 4>) = Ili~i c m Yf{6,<j>) almost everywhere in Q a . Q.E.D. 

This completes the systematic study of the angular dependence of 
internal energy eigenfunctions ^(r) of two-body quantum mechanical 
systems interacting via central potentials. We turn now to the study of 
the complete solution of the eigenvalue problem of Equation (7.7) for 
the special case of Coulomb (attractive) potentials 

V 0 (r) = -c/r, c > 0. 

7.7. The Two-Body Problem with a Coulomb Potential 

When we are dealing with the hydrogen atom, our two-body system 
consists of an electron carrying one elementary unit of negative charge 
—e, and a nucleus carrying the positive charge + e. The constant c is in 
this case equal to e 2 . We shall denote the positive constant c also in the 
general case by e 2 . 

Since we know that the only possible values of A in (7.14) are 
A = —/(/ + 1)> / — 0, 1, 2,..., the equation (7.15) for the radial part of 
the wave function assumes the form 


(7.50) 


d 2 Ri(r) 

dr 2 


2 dR t (r) r 2 m 0 E 1(1+ 1) , 2 m 0 e 2 

+ r dr + L 


dr 


h 2 


+ ■ 


h 2 


] Ri(r) = 0, 


where we introduce the subscript l in R\(r) to indicate that the solutions 
of (7.50) depend implicitly on the parameter /. 

If we look for negative energy solutions, E < 0, we can introduce 
the new variable 


(7.51) 


V—8 m 0 E 

p = -jj^-r 


and in (7.50) make the substitution 

(7.52) R t (r) = p l exp (— p/2) v t (p). 
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After some simple algebra we find that v t (p) has to satisfy the confluent 
hypergeometric equation 


(7.53) 


, d2 Vl(j>) I pi i o V d *l(p) (1 , , 

P-J-^ + (2i + 2 - -V + 1 



») ®i0>) = o. 


We shall attempt to find solutions of the above equation which can 
be written as a series of the following form: 


(7-54) v t (p) = p" £ a kP k = £ a kP k +\ 

&=0 ft=0 

If we insert the above series in (7.53) and assume that we can differen¬ 
tiate term by term, we get an expansion which has to vanish identically, 
and therefore the coefficients of this expansion have to be zero. By 
equating these coefficients to zero we get the relations 

(7.55) v{v + 2 l+l)a 0 =0, 

(7.56) ( k + v )(k + v + 21 + 1) a h — (k + v + / — n) a h _ 1 , k — 1, 2,..., 

which can be looked upon as recursive relations for determining 
a 1 , a 2 ,... . We can satisfy the indicial equation (7.55) by choosing v — 0. 
By solving (7.56), starting with a 0 = 1, we get as a solution of (7.53) a 
confluent hyper geometric function given by the Kummer’s series 


(7.57) F(l + 1 


— n | 2/ -|- 2 | p) — 


«(/-»+ 1) ...(/-» +A) 

*ti (2/ + 2)-(2/ + A + l) k\ * 


It can be verified (see Exercise 7.3) that the above power series has an 
infinite radius of convergence, and since the nth derivative of a function 
represented by a Taylor series can be obtained within the radius of 
convergence by differentiating the series term by term n times, (7.57) is 
indeed one of the solutions of (7.53). 

The equation (7.53) is an ordinary linear differential equation of the 
second order; consequently it has only two linearly independent solu¬ 
tions. A second solution of (7.53), which is linearly independent from 
(7.56), can be obtained by taking the solution v = —21 — 1 of the 
indicial equation (7.55). If l ^ 0, 1, f,..., we can derive from (7.56) 

the coefficients of the series (7.54) for this second solution. We note that 
this second solution behaves like when p —► 0. 
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Unfortunately, in our case l — 0, 1, 2,..., and (7.56) does not have 
any solutions because it cannot be satisfied for k~ —v — 21 + 1. 
However, a second linearly independent solution of (7.53) can still be 
obtained (see Bateman [1953, Vol. I, Section 6.7.1, p. 261, Eq. (13)]), and 
its behavior for small p is again ^p -21-1 . Consequently, R t (r) for this 
second solution [see (7.52)] behaves like p - * -1 for small p. Thus, for 
l — 1,2,... the corresponding function (7.13) is not integrable in the p, p 
measure (7.12) because of its behavior at p = 0, and therefore these 
solutions are ruled out. For l = 0 this solution provides a square- 
integrable function ^(r), but it is still ruled out on the following grounds: 
if a wave function i/* c (R) ^(r), with ’f'(r) behaving like 1 \r for r -> 0, 
would be included in the family of functions on which we apply the 
Schroedinger operator form H s [see (7.3)-(7.6)], then H s would not 
define a Hermitian operator (see Exercise 7.4)—a requirement which 
constitutes one of the postulates* of quantum mechanics. 

We return to the solution(7.57) of (7.53).If l + 1 — n${0, — 1, —2,...}, 
then all the coefficients in the power series (7.57) are different from 
zero. Therefore, when r->+oo, i.e., for p—>+oo, the function 
F(l + 1 - n) | 21 + 2 | p) in (7.57) obviously diverges faster than any 
finite power of p, and from the similarity of the Rummer’s series to the 
exponential series for e p y one can suspect that F(l -j- 1 — n | 21 + 2 | p) 
diverges about as fast as e p . In fact it can be shown (see Bateman [1953, 
Vol. 1, Section 6.13.1, p. 278]) that in this case the function 
F(l + 1 — n | 21 + 2 | p) behaves like p -1-1-71 ^ for large p. Consequently 

Ri(r) = p*exp(— pl2)F(l + 1 - n | 21 + 2 | p) 
has the asymptotic behaviour 

R i( r ) ~ P _w_1 ex P(P/2), p ^ +oo, 

which indicates that R l $L 2 (Q r , p, r ), i.e., the corresponding function 
(7.13) is not square integrable in that case. 

Thus, we are left only with the possibility that /+ 1 — ne{0, —1,..,, 
-n',...}. When 

(7.58) / + 1 - n = «' = 0, 1, 2,..., 

all the coefficients in the series (7.57) from the n' th on vanish, and the 


* See Chapter IV, §1. We shall analyze this problem more completely in §7 of that 
chapter. 
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series reduces to a polynomial* of the n 'th degree. In this case the radial 
function 


(7.59) 


Rm(r) = p l exp(— p/2)F(l +1 —n\2l + 2\p). 


P 


2 me 2 


n — /+!,/ + 2 ,..., 


is square integrable in the measure /x r . Furthermore, for each fixed /, 
l = 0, 1, 2,..., the set {R ni , n = / + / + 2,...} is an orthogonal system 

in L 2 (£ r , fi r ), 


R m(r) R n Jr) r 2 dr = ~ 


- / - 1)1(21 + 1)! 

(n + /)! 



S 


W7&i * 


Consequently, the corresponding wave functions 


/ ^2 \ —3/2r I /V *11/2 

(7.60) 

« = 1, 2,..., / = 0, 1, 2,..., n — 1, m — —/, —/ + 1,..., +/, 

constitute an orthonormal system inL 2 ((R 3 ); r) is an eigenfunction 
of the internal energy Schroedinger operator 

(7.6D 

and has as an eigenvalue 

(7.62) = n = l, 2,..., 

as can be seen from the definition of n in (7.53). It is intuitively obvious 
from the above procedure that the above values are the only eigenvalues 
of H { corresponding to square-integrable functions—as long as we 
restrict the domain of definition of H i by requiring that (7.61) defines a 
Hermitian linear operator on L 2 ((R 3 ). A rigorous justification of this 
statement requires some familiarity with the theory of linear operators 
on Hilbert spaces, and will be discussed in Section 7.7 of Chapter IV. 


# Equal to n'l L% l , + 1 (p), where L£ l , +1 (p) is an associated Laguerre polynomial, 


lUp) - (-D* 


d k 

dp k 


Ln'+kip )» 


itncl L v (p) is a Laguerre polynomial (sec Exercise 7.6). 
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Thus, we have arrived at the conclusion that the internal energy 
discrete spectrum of the hydrogen atom is 

_ ( m 0 e* m 0 e* m 0 e* ) 

p ~ \~~W 9 

In spectroscopy it is customary to denote the eigenfunctions corre¬ 
sponding to the above energy levels, in the above order, by the letters 
s, p, d y /, g,... . The numbers n, /, and m, are called, respectively, the 
principal , the azimuthal , and the magnetic quantum numbers. When an 
eigenstate of the hydrogen atom is given by writing one of the above 
letters s , p , d,... preceded by a natural number n, then n denotes the 
principal quantum number. Thus, 3 p refers to any of the eigenstates of 
(7.61) with energy E s , [obtained by taking n — 3 in (7.62)], which is 
also an eigenfunction of the angular momentum operator L 2 defined 
essentially by (7.48), with an eigenvalue /(/ -f 1), where 1=1. 

It is worth mentioning that the above terminology has become standard 
in atomic physics, and is used, with certain modifications, even in 
nuclear physics. It is applied not only to the hydrogen atom, but also to 
other atoms. This application to more complicated atoms with more 
than one electron in their electronic “shell” is made possible by an 
approximate model of such an atom. According to this model the 
complicated many-body interactions between each particular electron 
and the rest of the electrons are neglected, and each electron is envisaged 
to move in a Coulomb potential generated by the nucleus and all the 
rest of the electrons which are closer to the nucleus than the considered 
electron. This model, though too rough for quantitative analysis, 
provides an adequate qualitative picture allowing a simple classification 
of the electron “orbits” (i.e., eigenstates of the Schroedinger operator). 

According to Pauli's exclusion principle ,* there can be in this model 
at most 2 YaZq (2/ -j- 1) electrons in states *P n i m with a principal number 
n. All the electrons in such states are said to belong to the same “elec¬ 
tronic shell,” and when there are precisely 2^to (2/ + 1) electrons in 
a shell, it is said that the shell has been “filled out.” In such a “shell 
model” of an atom, the ground state* of the atom has the property that 
no electrons can be found in the nth shell if the (n — l)st shell is not 
filled out. 


* Pauli’s exclusion principle states that in an atom there are at most two electrons in 
a state specified by given n , /, and m. 

+ The ground state of a quantum mechanical system is the state in which the internal 
energy of the system is at its lowest possible value; e.g., the ground state of the hy¬ 
drogen atom corresponds to the principal quantum number n • ■ 1. 
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The above model leads in a natural way to the periodic table of 
chemical elements if it is postulated that the number of electrons in the 
electrically neutral atoms of a chemical element is equal to the atomic 
number (in the table) of that element. Many of the regularities in the 
chemical properties of the elements, which were first observed by 
Mendeleev in the 19th century, can be deduced from the above picture 
of the ground state of an atom. 


Exercises 


7*1 ♦ Check that for any twice-differentiable function /(#, y, z) the 
identity 


( 


a 2 

dx 2 



d 2 

dz 2 


-)/(*> y, *) 


r 2 



+ 


1 d 
sin 0 36 



1 3 2 i 

f( r sln 6 cos & r Sln 0 Sln & r cos 0 ) 


is satisfied, except for the points where 0 — 0 and 0 — 77 , i.e., for 

r = ze z , — 00 < 0 < -f°°. 


1.2. Show that 



1 r +1 r d\u 2 - 1)* I 2 _ 2 

— 4*(/!) a J L du l \ 21 + 1 


by integrating by parts l times. 


1.3. Prove that Rummer’s series (7.57) converges for every p e C 1 . 

1A. Denote by 3) the subset of all elements /eL 2 (^ p ,/x p ) [see 
(7.10) and (7.12)] which can be represented by functions/(r, 0, </>) which 
are twice continuously differentiable everywhere in Q v , except maybe 
for r — 0, and such that AfeL 2 (Q v , /x p ), where 


(0 

(Af)(r, 0, <f>) = J. _ _a_ / f(r, 6,<f>) \ 
( r 2 dr \ dr J 


for r = 0 
for r ^0. 


Show that 3 is a linear subspacc of L\Q V , /x p ) and that A is a linear 
operator (see Definition l.l in Chapter III) mapping 3 into L 2 (Q V , /x p ). 
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Prove that this linear operator is not Hermitian, i.e., 
f /*(', 0, </>)(Ag)(r, e , 4>) d^(r, 0,<f>)^ f (Af)*(r, 0, <f>) g(r, 0, </>) 0, <f>) 

Q $ Q g 

for some f, g e 

7.5. Find the internal energy spectrum and the corresponding 
eigenfunctions of a two-particle system, moving in three dimensions 
and interacting by means of a central square-well potential 

ys v _ (0 for 0 ^ r ^ L 

{r) ~ \ v 0 > 0 for L < r < +oo 

by solving the radial equation (7.15) with the above potential. Compare 
the present solutions with the one-dimensional case treated in §5 of 
Chapter I. 

Note. This problem is discussed by Messiah [1962, Chapter IX, 
Section II, pp. 359-361]. 

7.6. Prove by using the Leibniz rule for the nth derivative of 
p n e~ p that the Laguerre polynomials 


L n ( P ) = n\F(—n | 1 | p) 


(i + i (-i)‘ 


n(n — 1) ••• (« — k + 1)\ p h 


k\ 


\E1 
) kl 


= y (—i)* ^ -£? 

’ {n~k)\k\ k\ 

can be written as 

d n 

L n{p) = e ° J^{p ne ~ P Y 

Use the above formula to show that the set {/ 0 , l x ,...}, where 

I n (p) = (lln\)e-'»L n (p) 

are the Laguerre functions, is an orthonormal system inL 2 ([0, 

7.7. Derive from the results of Exercise 7.6 that 


I 


0 


L n ( P ) 


n\ 


ex p[ W(l - *)] 

1 - s 


s l 
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(the function on the right-hand side is called the generating function of 
Laguerre polynomials). Use this formula and the orthonormality of 
{/ 0 , to show that 

Jfe /” (rb exp h \ T=i‘ p ] ~ | 0 snlM f dp 



7.8. Prove that the orthonormal system {/ 0 , l x ,...} (see Exercise 7.6) 
is a basis inL 2 ([0, + 00)). 

7 .9. Prove that the system {h 0 , h x ,...} of Her mite functions defined 
by 

h n (u) = e~ u2/2 H n (u), 

where H n (u) are the Hermitian polynomials (see Chapter I, Exercise 
5.4), is an orthonormal basis inL^IR 1 ). 

References for Further Study 

On measure theory (§§1-3) consult Halmos [1950], and Munroe [1953]. A less general 
but very readable book is the work of Kolmogorov and Fomin [1961], which is also 
useful for §4. 

On spaces of square integrable functions (§4) see Riesz and Nagy [1955], and von 
Neumann [1955]. 

For §5 and §7 consult Landau and Lifshitz [1958], Messiah [1962], and Gottfried [1966]. 
A general treatment of tensor products of Hilbert spaces can be found in the work of 
Dixmier [1952]. 



CHAPTER III 


Theory of Linear Operators 
in Hilbert Spaces 


In Chapters I and II we met a large number of linear differential 
operators and other linear transformations. Linearity and a few other pro¬ 
perties (such as boundedness, self-adjointness, etc.) of operators, which 
can be formulated and studied in a general and abstract manner without 
having to specify the operators, imply that the operators possessing these 
properties have many other additional properties. These properties are 
derived in the theory of linear operators. The object of the next six 
sections of this chapter is the study of that part of the general theory of 
linear operators in Hilbert spaces which is of immediate importance in 
quantum mechanics. The spectral theorem, formulated and proved in 
§6, is the main result of this theory. 

A knowledge of the theory of linear operators in Hilbert spaces will 
enable us to study the structure of quantum mechanics from an abstract 
and very general point of view. In such a context the formalism of 
wave mechanics becomes a particular realization of the general formalism. 

L Linear and Antilinear Operators on Euclidean Spaces 

1.1. Linear and Antilinear Transformations 

A mapping of a vector space ^ into another vector space ^ is usually 
called a transformation of ^ into or an operator from into . 

It is evident from the preceding two chapters that a special class of 
operators, called linear operators, are of great significance in quantum 
mechanics. Besides linear operators there is a second class of operators 
which play a role in quantum mechanics, though a very minor one by 
comparison with the linear operators. These are the antilinear operators. 
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Definition 1*1* A mapping 

fer l9 f'efl, 

of the vector space i r 1 over the field F into the vector space over the 
same field F is called a linear transformation (linear operator in the case 
that* = ir 2 ) if 

A ( a f + h) = aA (f) + bA(g) 

for all a, b e F and all /, g e . For vector spaces and over the 
field C 1 of complex numbers, the mapping 

fer i9 f"e*T 2y 

is called an antilinear transformation + {semilinear transformation) if 
B(af+kg) = a*B(f)+b*B(g) 
for all a, b e C 1 and all /, g e . 

It is customary to denote the image A (f) of / under a linear or anti¬ 
linear transformation simply by Af 

We have met many linear transformations in the preceding chapters. 
For instance, the unitary transformations used in defining the concept of 
isomorphism of Euclidean spaces (see Chapter I, Definition 2.4) are linear. 

It is important to realize that the simplest type of linear operator is 
multiplication by a scalar a from the field F: 

f>->af, /ef. 

We shall denote the above operator simply by a . However, the reader 
should realize that there is a conceptual distinction between a as an 
element of the field, and a as an operator! 

As an example of an antilinear operator on the vector space ^(U 1 ) 
of continuous complex-valued functions on U 1 consider the operator C 
of complex conjugation 

(Q0(*) =/*(*), /e*"(IP). 

# This particular distinction between transformation and operator is adopted by us as 
very convenient, but it is by no means standard. It seems to be the case that, depending 
on the author, either the term “transformation” or the term “operator” is vsed exclusively. 

1 Note that an antilinear transformation is distinct from a linear transformation only 
for complex vector spaces, or, more generally, for vector spaces over fields F in which 
u * operation (or involution operation) is defined by a mapping a -> a*, a, a* e F , obeying 
the conditions that (a*)* “ a, a + b —► a* -f-6* and ab —► 6*a*. Since the case of 
complex vector spaces is of sufficient generality for quantum mechanics, we ignore the 
more general possibilities. 
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We have 

C(af(x) + bg(x)) = a*f*(x) + b*g*(x) = a*(Cf)(x) + b*(Cg)(x). 

1.2. Algebraic Operations with Linear Transformations 

Let A x and A 2 be linear transformations of ^ into i^ 2 . Then, as can 
be easily checked, the transformation A defined by 

Af=AJ+AJ , /eti, 

is also a linear transformation, called the sum of the transformations 
A ± and A 2 and denoted by A ± -j- A 2 . 

Let now A be a linear transformation of into i^ 2 , and B a linear 
transformation of 'V' 2 into ^ . Then it is easily seen that the transforma¬ 
tion C of ^ into ^ , defined by 

Cf — B(Af), fsr l9 

is also linear; C is called the product of B and A , and is denoted by BA 
(which is, in general, different from AB). 

Finally, if A is a linear transformation of the vector space into the 
vector space i^ 2 over the same field F, and a e F, the transformation 
aA defined by 

(aA)f = a(Af), /e^, 
is again a linear transformation. 

When we are dealing with the set 91 of all linear operators on a vector 
space over a field F, we see from the above definitions that for any 
A y B e 91 and a e F the operators aA, A + B and AB are always 
defined. These operations in 91 endow 91 with a certain algebraic 
structure. Such structures have been systematically studied—very much 
like the vector structure of vector spaces. Sets with such structures 
belong to mathematical spaces called associative algebras (to be distinguis¬ 
hed from Boolean algebras), or simply algebras. 

Definition L2* An algebra over a field F is a linear space 91 over 
that field on which, in addition to the vector operations, an operation of 
algebraic multiplication is defined. An algebraic multiplication operation 
is a mapping 

(A,B)h> AB, (A,B)e 91 x 91, AB e 91, 

of 9f X 91 into 91 having the properties 
(1) (AB) C = A(BC), associativity, 
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(2) (A + B)C = AC + BC , C(A + B) = CA + CB, distributivity, 

(3) a(AB) = ( aA)B = A(aB), associativity of scalar multiplication, 

for all A, B, C e 51, and for each element a from the field F. 

We say that the algebra 51 has an identity if there is an identity 
element 1 satisfying 

1A = A1=A 


for all A e 51. 

The algebra 51 is said to be commutative if AB = BA for all A, B e 51. 
It is called real or complex if the field on which it is defined is the field of 
real or complex numbers, respectively. 

It is easy to see that all the complex n X n matrices with matrix 
multiplication as the operation of algebraic multiplication form a 
complex algebra. 

Definition 13* Two algebras 5l x and 5I 2 over the same field F are 
isomorphic if there is a one-to-one mapping 

A 1 <->A 2 , A 1 eM 1 , A 2 e 5I 2 , 
which has the property that for any A ± , B ± e 5l x and a e F. 

A 1 + B 1 «—> A 2 + B 2 

aA 1 < -> aA 2 , a e F, 

A x B x ^>A 2 B 2y 

where A 2 , B 2 e 5I 2 are the images of A ± , B ± e 5Ij , respectively. Such a 
mapping is then called an isomorphism between 5l x and 5I 2 . If 5I 2 is an 
algebra of matrices (with the algebraic operations determined by matrix 
summation and multiplication) then the isomorphism is called a faithful 
matrix representation of 5I X . 

Theorem lA* The set 5l(Jf) of all linear operators defined on a 
real (complex) finite-dimensional Hilbert space is a real (complex) 
algebra with identity under the operations of operator' summation, 
multiplication and multiplication of operators by scalars f , with these 
operations defined, respectively, by the relations 

(1.1) (A +B)f=Af+Bf, A, Be 5l(^) 

(AB)f = A(Bf) 9 A, Be «(^T) 

(aA)f = a(Af), A e 5lpf), a e IR^C 1 ), 
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valid for all/e . If the dimension of ^ is n, and {e 1 e n } is any given 
orthonormal basis in the mapping 


(i.2) 


A 


a U *** a ln 
\&nl *** 


A e 


a tj = <e* | Aej), i,j = 


is a faithful representation of $I(Jby the real (complex) algebra of all 
n X n matrices; this representation is called the matrix representation of 
in the basis {e 1 e n }. 

Proof . It is a straightforward task to check that is an algebra 

with identity, if the identity is taken to be the linear operator 1 defined 
by the identity mapping 


/h>1 / = /, fetf. 

The matrix in (1.2) is obviously real if is a real Hilbert space; for 
a complex Hilbert space (1.2) is in general complex. If {e 1 e n } is an 
orthonormal basis, by taking advantage of the properties of an ortho¬ 
normal basis (see Chapter I, Theorem 4.6), we can write 

n n 

(1.3) Aej = £ < e < I Aej) e t = £ a«e< . 

i =1 i=l 

The above relation shows that the mapping (1.2) is one-to-one, since to 
every matrix corresponds only one linear operator A satisfying (1.3), 

(1.4) Af = A t I/> 1 l/> Ae } 

3=1 3=1 


= I <«,!/> (t <‘i \Aej>e) 

3=1 \i=l / 

= I (l <«<l ^><*I/>U, 

i=l \j=i / 

i.e., the image ^4/ under A of any vector / is completely determined by 
giving the matrix || a tj || in (1.2). 

If B e and || by ||, b = < e i | Bef), is the image of B under the 
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mapping (1.2), we get by using the linearity of A and B and the defini¬ 
tions (1.1) 

A + B ~ IK* \(A+B) *>11 = ||<* | Ae, + £*>|| 

= IIO< I Ae.y + <* I £*>11 - II a tj || + || b„ ||, 

AB i ||<* | 4£*>|| = ||(* | A £ <* | £*■> *A| 


/ W 

<«* I 


<«* I B*> Ae t 


= \(t< e * I B'iXfit I ^>)|| 

= a iJc^Jc})|| == II a ik II ■ II bki II, 


i.e., A + B and AB are mapped by (1.2) into, respectively, the sum and 
product of the matrices || a tj || and || b tj ||. Since 

aA ^ ||<«< | (aA) e ,)|| = || *<«, | Ae^W - a || ^ ||, 

we have established that (1.2) is an isomorphism. Q.E.D. 

We recall that every finite-dimensional Euclidean space is a Hilbert 
space. In general, in any given n-dimensional vector space 'V' we can 
easily introduce an inner product, e.g., by choosing a vector basis 
{/l ,•••>/«} C i r and writing for every/, geT, 


<f\g> = £ «»*&», 

fc=l 

where a x ,..., a n and b 1 ,..., b n are the coefficients of/and g, respectively, 
in the basis {f x 


n n 

f ==: X ’ £ — X! • 

fc=l fc-1 ' 

Consequently, the above theorem can be applied, de facto , to any finite- 
dimensional vector space. In fact, Theorem 1.1 can be proved directly 
for finite-dimensional vector spaces, without the introduction of an 
inner product (see Exercise 1.4). 



178 


III. Theory of Linear Operators in Hilbert Spaces 


1.3. Continuous and Bounded Transformations 

As the concepts of continuous functions and bounded functions are 
of great significance in the theory of functions, so are the analogous 
concepts which can be defined for operators acting on Euclidean spaces, 
or more generally, on normed spaces.* 

Definition l A* A transformation 

/k>T(/), f e JT X , Tf e , 

of the normed space into the normed space Jf 2 is said to be continuous 
at / 0 , / 0 e , if for any e > 0 there is an 8(e) such that 

II T(f) - T(f 0 )|| 2 < e for 11/ -/ 0 Ik < 8(e), 

where || h |k denotes the norm of h e j \ r i , i = 1,2. The transformation 
T is said to be continuous if it is continuous at all points / 0 e . 

Definition 1*5* A set S in a normed space is bounded if there is 

a constant C such that ||/|| ^ C for all feS.A transformation T of 
into is called a bounded transformation if it maps each bounded set 
in into a bounded set in Jf 2 • 

The requirement that a linear transformation T be bounded is obvious¬ 
ly equivalent to the requirement that there is a constant C such that 

WTfW^cwfWi 

for all /e A r 1 since || T(af)\\ 2 = | a | || 7y|| 2 for any a e C 1 . 

If A is a linear operator on a normed space then the supremum 

(1.5) MII = sup feJT lt 

is called the bound of the operator A ; obviously || A || ^0. 

Note that if A is bounded, then its bound || A || is finite. Vice versa, if 
|| A || < + oo then || Af\\ < c || A || for all ||/|| < c , i.e., A is bounded. 

For linear transformations we have an intimate connection between the 
concepts of boundedness and continuity. This connection is stated and 
proved in the second of the following two theorems. 

Theorem 1*2* If the linear or antilinear transformation T of the 
normed space into the normed space is continuous at one point / 0 
of * then it is continuous everywhere in ^i. 

* These concepts are defined in their most general form for topological spaces. The 
present definition is, however, sufficiently general for our purposes. 
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Proof. If T is continuous at / 0 e d\ , then for any e > 0 there is a 
8 (e) such that 

(1.6) II r(/ -/ 0 )|| 2 = \\Tf— Tf 0 || 2 < e 

for all II/ —/ 0 ||i < 8(e). For arbitrary £ 0 e .A\ we obtain 
II Tg - Tg 0 || s = || T(g-g 0 )\\ 2 < e 

for all || g — g 0 Hi < 8(e) by taking g — £ 0 = / — /„ in (1-6). Q.E.D. 

Theorem 13* A linear or antilinear transformation T of the normed 
space into the normed space dT^ is continuous if and only if it is 
bounded. 

Proof. If || T\\ < +oo, and 0 1 denotes the zero vector in dT x , then 

II “0,11, = ||7y|| a <e 

for all ll/ll! < §( e ) = 6 || T|| _1 . Consequently, T is continuous at the 
origin O x e , and by the preceding theorem, it is continuous every¬ 
where in dT± . 

Conversely, if T is assumed to be continuous at 0 1 , there is such a 
S 0 > 0 that || Tg || 2 ^ 1 for all || g Ih ^ S 0 . For any f s we have that 
g = S 0 || /Hr 1 / satisfies || g 1 ^ = S 0 and consequently 

Soll/lir 1 !! 2 / 112=11 ^( 8 0 |j/nr 1 /)ll 2 ^i. 

Thus, if ||/||i ^ c for any given c >> 0 , then 

II Tf ||, ^ c&q 1 , 


i.e., T is bounded. Q.E.D. 

1.4. Examples of Bounded and Unbounded Operators 

The study of operators on finite-dimensional Euclidean spaces is 
considerably simplified owing to the fact stated in Theorem 1.4. 

Theorem L4* Any linear transformations of a finite-dimensional 
Euclidean space ^ into another Euclidean space <f 2 is bounded. 


Proof. Let the subindices i 1,2 in <• | refer to the inner 
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products in and , respectively. If dim = n and e ly ... y e n is an 
orthonormal basis in S\ , we have for any / e 


(1.7) 


<Tf I Tf\ = (£ <e ( |/> x Te t £ <e, |/> x 7V, 


3=1 


= I </l ^>i<T>i | Te^iej |/> x . 

«',3==1 

Since according to the Schwarz-Cauchy inequality 

\< e i l/>i I II ^ 111 ll/lli = ll/lli, 

we get from (1.7) 

II 77II2 2 < ( t \< Te i I 1) ll/llx 2 . 

\*,3=1 / 

The above relation shows that || 7y|| 2 < CH/Hi for all f ^ A r 1 , which 
implies that T is bounded. Q.E.D. 

The above theorem does not apply to infinite-dimensional spaces. In 
fact, most of the operators of interest in quantum physics are unbounded. 
For instance, consider the space ^(R 1 ) of all continuous functions 
f(x), x g R 1 , of compact support. The operator A 

(Af)(x) = xf(x) 

is obviously linear, but it is not bounded. To see that, take a function 
m ^ 0 with support inside [0, 1] which does not vanish identically. 
For each a g C 1 the function f a (x) = f(x — a), has its support inside 
[a, a + 1], and f a (x) e ^(R 1 ). Since || f a || = ||/ 0 || = const, while 

/»+°° /»®+l 

II Af a || 2 = x 2 f 0 2 (x — a) dx — x 2 f 0 2 (x — a) dx 

J —co ^ a 

/»+°° 

3* a 2 I fa 2 {x) dx = a 2 ||/ 0 || 2 - w ^> +co, 

« —co 

we see that || A || = + 00 . 

Exercises 

1.1. Let a 0 (#),..., a n (x), x g C 1 , n ^ 1, be continuous functions. Show 
that the mapping 

f(x)>-*g(x) = a n {x) d - j l ~ -1 -4- a { (x) -|- « 0 (x), 

/e r <?"(R l ), ge.W\M l ), 
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is a linear transformation of the linear space ^ n (U x ) (of «-times 
continuously differentiable functions) into ^(IR 1 ). 

Remark. The above linear transformation can be characterized by 
the differential operator form 

d n d 

«»(*) dx™ + '" + ^ dx + 

and the specification of its domain of definition ^(IR 1 ). 

L2* Let / e/ 1 be a finite interval, and let K(x>y) be a continuous 
function on I X I. Prove that the mapping (Lebesgue integration) 

f{x) ~ (Kf)(x) = j' K(x, y)f(y) dy, f e L\I), 

defines a linear operator on L 2 (I). 

L3* Prove that the linear operator defined in Exercise 1.2 is bounded. 

L4* Let 'V' be a complex n-dimensional vector space, and/ x ,...,/ n 
a vector basis in Y'. Assign to each linear operator on 'V' the complex 
matrix || a ik ||, defined by expanding Af k , k = 1,..., w, in the vector 
basis/i 

n 

A fit X! * 

«=1 

Show that the mapping A || a ik || is an isomorphism between the 
algebra of all linear operators on the vector space 'V' and the algebra of all 
n X n complex matrices. 

L5* Prove that the linear operator 

(Bf)(x) = f(x)e^(n i), 

defined on the space ^((R 1 ) of infinitely many times continuously 
differentiable functions of compact support, is unbounded. 

L6* Denote by the algebra of all bounded linear operators 

acting on a normed space jV' (such as the Banach space L\Q, fi) of all 
/u-integrable functions on 13 ). Show that the operator bound || A ||, 
A € ^(./K ), defined by (1.5), has all the properties of a norm (see 
Chapter I, Definition 2.2) on W(./f’). 
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2* Linear Operators in Hilbert Spaces 

2.L Linear Functionals on Normed Spaces 

K functional on a vector space 'V' is a real- or complex-valued function 
defined on that space. As such, a real or complex functional can be 
thought of as a transformation of the vector space 'V' into the vector space 
(IR 1 ) or (C 1 ) of real or complex numbers, respectively. Hence, we can 
introduce the fundamental concepts of continuity, linearity, etc. by 
specializing the corresponding concepts, defined for transformations in 
general in §1, to the present special case of transformations. 

Definition 2*1 ♦ A functional </>, 

defined on the normed space Jf is continuous at the point / 0 e JT if for 
any € > 0 there is a 8(e) such that | -</>(/ 0 )| < e for all ||/-/ 0 1| < 8(e). 

The functional <f) is said to be continuous if it is continuous at all points of 
JT\ <f> is said to be bounded if it maps every bounded set of Jf into a 
bounded set in R 1 or C 1 , respectively, i.e., if for every N > 0 there is a 
constant C N such that | </>(/)| ^ C N for all/e jV' such that ||/|| ^ N. 

An important special class of functionals on vector spaces are the ones 
which possess linearity properties, e.g., the functionals 

</>(/) = Cf(x)dx , feW{[a,b]). 

J a 

Definition 2>2> The functional <j> defined on the real or complex 
vector space 'V' is linear if 

VJ + g)= <Kf) + m, M) = «Kf), 

for all f,g e 'V' and all scalars a from 1R 1 or C 1 , respectively. 

We can think of the vector space (1R 1 ) and (C 1 ) as normed spaces by 
taking the absolute value of a number to be its norm. By specializing 
Theorems 1.2 and 1.3 to the case of functionals, we can state the following 
theorem. 

Theorem 2*L (a) If the linear functional </>,/e.yF’, defined in the 

normed space Jf is continuous at one point, then it is continuous 
everywhere in (b) The linear functional c/> is continuous if and only 

if it is bounded. 
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2.2. The Dual of a Hilbert Space 

The family of all continuous linear functionals defined on a topological 
vector space* 'V' is called the dual (or conjugate) space of and is 
usually denoted by 7^'. The dual space is also a linear space, and plays 
an important role in the theory of topological vector spaces. By 
specializing the above statements to normed spaces we get Theorem 2.2. 

Theorem 2.2. Let Jf be a real (complex) normed space, and denote 
by JT the dual of Jf y i.e., the set of all continuous real (complex) linear 
functionals on jV'. If the sum ^ -f- of two functionals ^ and </> 2 on jV' 
is defined by 

(</>! +<f>2)(f)=Mf)+Mf), 

and the product #</>, of a functional <f> on Jf with a scalar a e (R 1 (C 1 ) is 
defined by 

(*£)(/) = W>(/), 

then Jf' is a real (complex) vector space with the above operations as 
vector operations. Moreover, Jf' becomes a normed space if || • || is 
taken to be || || = sup | <f> (/)| for || (/) || < 1. 

The proof of this theorem is trivial and it consists in noting that 
</>i -f ^2 an d a< /> are continuous linear functionals if </>, , <f> 2 are 

continuous linear functionals. 

For the special case of normed spaces which are Hilbert spaces we 
have a very close relation between the space and its dual, as reflected in 
Theorems 2.3 and 2.4. To prove the first of these theorems we need the 
following lemma. 

*Lcmma 2.1. If M is a closed linear subspace of a Hilbert space 
and / is a vector in there is a vector /' e M such that /" — / — /' is 
orthogonal to M. 

Proof. Denote by d the distance between/and M, i.e., 


d = inf 11/ 

sreM 11 J 


If d = 0, there must be a sequence^ , g 2 ,... e M such that 11/ — g n || —► 0 
when n —► go; since M is closed, we have in this case/e M and the lemma 
is proved by taking/' = / and/" = 0. 


# The concept of topology provides the most general means of introducing neigh¬ 
borhoods, and therefore the notion of convergence in an arbitrary set. See, e.g., Treves 
[1967] on the theory of topological vector spaces. 
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Assume that d > 0, and let g x , g 2 ,... e M be the sequence for which 

d = lim \\f — g n II- 

00 

It is easy to check by using the algebraic properties of the inner product 
that 

i 11/ - gm II 2 + *11/ — gn It 2 = 11/ - i (gm + gn)\\* + II ~ gn)\?- 
If we let above m, n —> oo and note that 

ll/~ \{gm +gnW > d 2 

because %(g m + g n ) e M, we get || g m — g n ||-* 0. Thus, g x , g 2 is a 
Cauchy sequence, which has a limit/' e M, since M is closed. Moreover, 
we have 

11/-/'II = lim ||/ — || = d. 

oo 

In order to show that/" _L M, it is sufficient to prove that </" | K) = 0 
for all normalized he M. By means of the identity 

<h\r-<h\f"}hy = o 

we get 

II/" II 2 = II/" - <h |/"> h || 2 + ||<A |/"> h || 2 , 

and since \\f" — <h \/">A || ^ d because (h |/">A e M, we have 

d 2 = ||/" II 2 - ||/" ~ <h |/"> h || 2 + \<h |/">| 2 > + \<Jt \nw 

i- e -, \(h |/">| = 0. Q.E.D. 

Theorem 2*3* (jR/w#’ theorem). To every continuous linear func¬ 
tional <j> on a Hilbert space corresponds a unique vector g e , such 
that 

(2.1) <£(/) = <g I/> 

for all/eJf. 

Proof. Denote by the null space of </>: 

M, ={/:*(/)=<> ,/eJf}- 

is a closed linear subspace of its linearity is evident, while the fact 
that is closed follows from the continuity of <f>: if/! ,/ 2 ,... e and if 
/ = lim> then 

W) = <Kfn) + *(/ -/») = <Kf~fn), 
and since lim n -> x <£(/ — /„) = 0. we have = 0, i.e., /e M* . 
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If == the theorem is established by taking g = 0. Therefore, 

assume that ^ Then there is at least one nonzero vector h $ M* . 
Since is closed, we can use Lemma 2.1 to state that h can be written 
as the sum of two vectors g f and g ", where g" J_ while g' e . The 
vector g" must be different from the zero vector because otherwise we 
would have h == g f e . Thus, <t>{g n ) =£ 0, and the vector 
/ “ (<£(/W(£ ,, ))£ ,/ exists. This vector belongs to M* because 



Since g" 1 , we have 



4if) 




¥) 


= o 


for any fe 34?. From the above relation we get 


4(f) = ( 


r(gi 

<g n i g*> 



we obtain (2.1) by setting 


g = 


W) 

<g n \g n '> 


g 


In order to establish that g is unique, assume that |/> 

for all /. Then we have 


<g ~gi I /> = <g I /> - <gi I /> = 0; 

by choosing / = g — gl we get <g — g 1 \g— g{> = 0, i.e., g = g t . 

Q.E.D. 

Theorem 2*4* Let be a Hilbert space and its dual. There is a 
one-to-one antilinear* transformation 


(2.2) <f> —► , </> e Jf", g^eJ^, 

of onto such that for each <f> e Jf 7 ' 

<£(/) = l/> 


for all fe Jt. 

# In the case that MP is real, ft?' is real too, and there is no distinction between linear and 
antilinear transformations of Jf' into Jf. 
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Proof. According to Riesz’ theorem, the above mapping is indeed a 
one-to-one mapping of Jf 7 ' into Jf 7 . Since for any g e the functional <f> 
defined by 

m = <g\/> 

is continuous, this mapping is onto . 

The antilinearity of this mapping is evident: 

“<l>i (/) + HfJ) = l/> + Kg* % I/> 

= <**g * 1 + b*g*\f>- Q*e.d. 

The above theorem justifies the notation introduced by Dirac, which 
has become very popular in physical literature. In this notation a vector/ 
of a Hilbert space is denoted by |/>, and called a ket vector. The 
functional <f> e Jf 7 ', which under the mapping (2.2) has / as an image is 
denoted by </1 and called a bra vector. The inner product </1 g ) of 
two vectors/, g e can be looked upon as a product of the bra </ | and 
the ket | g ). 

The above notation neither presents any real advantages, nor does it 
enrich the techniques when dealing with Hilbert spaces.* Consequently, 
it will not be adopted in this book. 

2.3. Adjoints of Linear Operators 
in Hilbert Spaces 

Riesz’ theorem enables us to prove the existence of the adjoint of any 
bounded operator acting on a Hilbert space. In order to give a sufficiently 
general definition of the adjoint of an operator, we enlarge our definition 
of operators on Hilbert spaces to operators in Hilbert spaces, by dropping 
the requirement that an operator be defined on the entire Hilbert space. 

Definition 23♦ A linear operator A in a Hilbert space is a linear 
transformation of a linear subspace 2 A of Jf 7 into 3 ?; 2 A is called the 
domain of definition of A and its image under A 

<% A ={Af:fe2 A } 

is the range of A. If another linear operator B defined on 2 B D2 A 
coincides with A on 2 A , i.e., Bf = Af for all /e 2 A , then B is called an 
extension of A (or A a restriction of B), and we write A C B. 

* There have been, however, proposals to incorporate this notation in the theory of 
rigged and extended Hilbert spaces. Then a ket is an element of and a bra is an element 
of a larger scale containing (see, e.g., the articles by Antoine [1969], PrugoveSki 
[1973b]). 
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Theorem 2*5* Let A be an operator on Jf 7 , with domain 2 A . 
Denote by 2 A * the set of all vectors g e Jf 7 which are such that for each 
g there is one and only one vector g* which satisfies the equation 

( 2 . 3 ) = 

for all/e 2 A . The mapping 

g->g*=A*g, g e 2 A * , 

is a linear operator, called the adjoint of A , and it exists, i.e., 2 A * ^ 0 
if and only if 2 A is dense in Jf 7 . 

Proof. In the case that 2 A is dense in Jf 7 and g, g* are two vectors 
satisfying (2.3), g* is uniquely determined by g; namely, if g x * also 
satisfies 

<g 1 4 f> = <£i* I/> 


for all/e 2 A , then 

<£* — gi* I/> = <£* I/> - <£i* I/> = 0 

for all/e 2 A , and since 2 A is dense in Jf 7 , we must have^* — gf* = 0. 
Thus, if (2.3) has any solutions, we can define an operator A*, with a 
domain 2 A *, consisting of all ^ for which there is a^* such that^ and^* 
satisfy (2.3), and mapping ^ into^*. 

The operator A* is defined at least for g — 0, since then we can 
take g* = 0. Moreover, if g x , g 2 e 2 A *, then 

<A*(agi + bg 2 ) | /> = <ag! + bg 2 | Af) 

= **<#! 1 4f> + b\g 2 1 4 /> 

= i/> + t>*<d*g 2 1/> 

= |/>, 

which proves that ag 1 + i^ 2 e and A* is linear. 

Assume now that 2 A is not dense in Jf. Then its closure 2 A is 
different from Jf 7 , and there is in Jf 7 a vector h $ 2 A . According to 
Lemma 2.1 we can write h = h! + A", h! J_ where h! ^ 0. We have 
then 

<*' l/> = 0 = <0 | 4/> 

for all f&2 A . This shows that when 2 A ^ Jf 7 , we can take in (2.3) 
g* = h\ Hence, we have at least two distinct solutions, 0 and h\ belonging 
to 0. Thus, A* does not exist, because (2.3) can never have a unique 
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for any g, since if for some g there is a g* satisfying (2.3), then£* + h ' will 
also satisfy (2.3). Q.E.D. 

An operator A might not have an adjoint A *, but there might still be 
many operators B satisfying the relations 

<Bg l/> = <g I Afy 

for all / e 2 A and g e 2 B . Two such operators A and B are said to be 
adjoint to one another. We note that if the adjoint A* of A exists, and A 
and B are adjoint to one another, then necessarily B C A*, i.e., A* is a 
maximal extension of all the operators adjoint to A. 

2.4. Bounded Linear Operators 
in Hilbert Spaces 

If A is bounded linear operator which has an adjoint A *, then from 
Theorems 2.6 and 2.7 it is easy to see that A* is also bounded, and that 
3f A * = 

Theorem 2*6 (The extension principle). A bounded linear trans¬ 

formation T from the Hilbert space into the Hilbert space M > 2 > defined 
on a dense domain 2 T (2 T — c^f/), has a unique continuous linear 
extension to the entire Hilbert space Jtf’ 1 . 

Proof. Let / be any given vector in . Since 2 T = , there is a 

sequence f x ,/ 2 ,... e 2 T converging to/. The sequence Tf x , Tf 2 ,... is a 
Cauchy sequence, since according to Definition 1.4 there is a constant C 
such that 

II Tfm — Tfn ll 2 ^ C ||/ m —f n ||x . 

Hence, the limit of Tf x , Tf 2 ,... exists. Moreover, this limit is independent 
of the chosen sequence converging to / because if f[ ,/ 2 ,... e 2 T also 
converges to /, then lim n _ >00 1| Tf n — Tf n ' || 2 = 0, since 

II ffn Tf n f H 2 ^ C ||/ w f n ' | |x • 

Thus, we can define a transformation T f for all / e by choosing for 
each vector / a sequence f x , f 2 ,... e 2 T converging to / and setting Tf to 
be the limit in the Hilbert space norm of Tf x , Tf 2 ,..., 
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This transformation T', with 3f T » — , is evidently linear: 

T\af + bg) = s-lim T(af n + bg n ) 

n->oo 

= s-lim(aT/„ + bTg n ) = aT'f + bT’g, 

n->oo 


where g x , g 2 ,... e 2 T is any sequence converging to g , and “s-lim” 
denotes the limit in the norm of • 

The fact that T is bounded follows from the inequality 

II II ^/-^/nll 2 +||T/J| 2 

<11 T'f-Tf n \\ 2 + C\\f n \\ l9 

since when we let n -> oo, || T’f — Tf n || -> 0, while || f n ||i -> H/Hj . 

It is evident that T' is the only continuous extension of T to the entire 
space . Q.E.D. 

Theorem 2*7* If A is a bounded linear operator defined on the 
entire Hilbert space Jtf*, its adjoint A* is also a bounded linear operator 
defined everywhere in and ||^4|| = || ^4* ||. 

Proof. For a fixed g e the functional^ defined by </>(f) = < g | Afy , 
/ e Jf 7 , is obviously linear. Moreover, <j> is also continuous, as can be seen 
by using the continuity of A (see Theorem 1.3) 

lim = <g | s-lim Afy — <g \ Af o y = <f>(f 0 ), 

ll/-/ 0 lho ru} ll/-/ 0 lho J / X6 J0/ 

Consequently, we can apply to <f>(f) Riesz’ theorem (Theorem 2.3) to 
deduce the existence of a unique £*, satisfying 

<**i/>=*a> = <*i4/>. 

By Theorem 2.5, the mapping 

g*~+g* = A*g, geJf, 

defines the adjoint of A as a linear operator, for which we see that 
From the inequality 

I <A*g l/>l = Kg I 401 <11 A II ll/ll II* || 


we get by setting f = A*g 
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The above shows that A* is bounded and || A* || ^ || A ||. By reversing 
the roles of A and A* we arrive at || A || ^ || A* ||, thus proving that 

Mil = M*ll- QE.D. 

2.5. Dirac Notation for Linear Operators 

It is interesting to take a look at the above theorem from the point of 
view of Dirac’s terminology. To realize the nontrivial implications of 
this terminology, consider the more general case of a bounded linear 
operator A defined everywhere on a normed space JT* If <f> is an element 
of the dual then the functional (f) 1 defined by 

feJr, 

is again a continuous linear functional, i.e., <fA e as can be easily 
proved by using the linearity and continuity of <f> and A (A is bounded 
and therefore continuous, by Theorem 1.3), 

W + bg) = WaAf + bAg) = a<j>(Af) + bftAg) = + b^(g), 

lim W„) = Hm <K4fn) = ^(s-limTO) = ^(i4(s-lim/ w )) = ^(s-lim/ w ). 

7|—>-j-00 71—71—71— »-j-00 ft ->-)-00 


Consequently, the mapping 

= a v, 

is a transformation of JT into itself. Furthermore, it is easily seen that A 1 
is a linear operator on JT f : 

i +bmn = Ki+ h 2 )(a/) 

= aHAf) + b UAf) = aA'Uf) + bA'Uf). 

Thus, every bounded linear operator A on Jf induces a linear operator 
A 1 on JT . In case of a Hilbert space to every element <f> g 
corresponds, according to Theorem 2.4, a unique vector g# g We 
have seen that in Dirac notation one denotes </> by the bra symbol 
<g* i. s ° that <f>(f) — < g* I/)- Moreover, in this notation one writes 

A'<f> = (g, | A, 

agreeing to denote the above introduced operator A 1 on Jf' by the same 
letter A, but introducing the clause that when A stands, de facto, for the 
operator A 1 , then it acts on the bra vector from the right . 

In this notation one writes 


•K4f) = VWJ) = <g* I a | />. 
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According to Theorem 2.7, we have 

<K4f) = < A *g* I/>, 

which implies that, using Dirac notation, 

<A*g* I = I ^ 

2.6. Closed Operators and the Graph of an Operator 

In the case that a linear operator A is unbounded and has an adjoint 
A*, we can deduce from the preceding theorem that A* is also 
unbounded when 3 A * — Jtf 7 [so that (^4*)* = ^4** exists]. In fact, if 
A* were bounded, then A ** would also be bounded, and since obviously 
A C A**, this would imply that A is bounded. However, A * has always 
the property of being closed, which is a somewhat weaker property then 
boundedness. 

Definition 2*4* The linear operator A in the Hilbert space Jf* is 
closed if it has the property that whenever f x ,/ 2 ,... e Of A converges to a 
vector/e and Af x , Af 2 ,... also converges to some vector then 

/ e 3 A and h = Af 

Theorem 2*8* The adjoint A * of a linear operator A in Jf 7 is closed. 

Proof . Let £i, # 2 >••• G ^ 4 * converge to g e and assume that 
A*g± , <4*# 2 >••• converges to a vector h . Then for any /e 

<* I /> = lim<^4 *# w |/> = lim<# n | i4/> = <# | -4/), 

n->oo «-»oo 

which by the definition of ^4* implies that# e 3 A * and that h — A*g. 

Q.E.D. 

The concept of a closed operator can be readily grasped in terms of the 
graph of an operator. The graph of an operator is a generalization of the 
concept of the graph of a real function f(x), x e IR 1 , which, we recall, 
consists of all the points ( x,f(x )) in ( U 2 ) — (IR 1 )® (IR 1 ) corresponding to 
x from the domain of definition of f(x ). 

Definition 2*5* The graph G A of an operator A acting in the 
Hilbert space is the subset 

G A ={(f,Af):fe3 A } 

of the direct sum Jf @ Jf (see Chapter II, Theorems 6.1 and 6.3). 

We note that A is closed if and only if whenever 


(A, 4/i), (/a » 4 / 2 )*-" e G a 
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is a convergent sequence in then its limit (/, g) lies in , 

i.e., is a closed subset of if and only if A is a closed operator. 

In Chapters I and II we encountered many operators A which were 
Hermitian , i.e., for which < Af \ £> = </1 Agj for all fgE 3 A . These 
operators were in addition linear. In mathematical literature such an 
operator is called symmetric. 

Definition 2*6* The linear operator A acting in a Hilbert space is 
symmetric if < Af \ g} = </1 Ag > for all /, g e Of A . A symmetric operator 
A is called self-adjoint if A = A*, i.e., A* C A and A C A*. 

We note that a self-adjoint operator is always closed (because by 
Theorem 2.8 A* is closed) and densely defined (since A* exists). 

All the Hermitian operators which have been introduced while 
studying different physical problems were symmetric. For instance, we 
have seen that the Schroedinger operator H s defined in Chapter II, §5, 
is linear, Hermitian, and defined—under reasonable requirements on the 
potential (see Chapter II, Theorem 5.5)—on the dense subspace ^ b ([R 3n ) 
(see Chapter II, Theorem 5.6). 

It is an unfortunate feature of unbounded linear operators defined on 
linear subspaces of a Hilbert space that they do not have closed exten¬ 
sions to the entire Hilbert space (see Section 3.5), and for this reason 
we had to give particular attention in Definition 2.3 to the domain of 
definition of an operator. This fact makes the study of unbounded 
operators particularly difficult, especially since it prevents simple 
definitions of quite fundamental algebraic operations with such operators. 
For instance, if A and B are two operators for which 3 A and 

3 B ^ then we can define A -f B only on 5$ A n 3 B , 

(A + B)f = Af+Bf f e 3 A n Q B . 

Since it might happen that 3 A n 3 B — {0} even when both 3 A and 3 B 
are everywhere dense in it follows that A + B might not be defined 
at all except on the zero vector 0. 

Similarly, in order to define AB , we must assume that Bf e 3 A 
whenever we apply AB to some / e 3 B , 

(AB)f=A(Bf), f e3 B , Bf e 3 A . 

Only in case of multiplication of an operator A by a scalar a can it be 
stated simply 

(aA)f = a(Af), fe3 A . 
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Despite these difficulties we shall have to study unbounded operators 
in some detail since practically all of the operators which appear in 
physics are unbounded, such as the position and momentum operators 
or the Schroedinger operator (see Exercise 2.4). 

2.7. Nonexistence of Unbounded Everywhere-Defined 
Self-Adjoint Operators 

We shall proceed to prove that no symmetric unbounded operator has 
a self-adjoint extension to the entire Hilbert space. For this proof we 
need some preliminary results. 

*Lemma 2*2* If </>i, </> 2 >••• is a sequence of continuous linear 
functionals on ^f 3 , which is bounded for each fixed/e ^f 3 , i.e., 

\Uf)\ <c f , 

then , (bo ,... are uniformly bounded within the unit sphere 
Si = {h: II h II < 1}, i.e., 

I Mi )I < c > g eS i . « = 1, 2,..., 

or equivalently 

\Uf)\ < C\\f\\, n = 1,2,..., 


for all /e Jf 3 . 

Proof. We shall show first that </>i(/), <j > 2 (/)>••• is uniformly bounded 
inside at least one sphere {h: || h — h 0 || < r} in Jf. 

Assume that the above statement were not true! Then we would have 
I ^(/i)! > 1 for at least some integer and some f x e . By the 
continuity of </> ni (/), we also have 

l<M/)l >i, / e s (1) > 

on a sphere S {1) C with the centre at f x . Since <t> 2 (/),••• are not 

uniformly bounded in any sphere, including S (1) , there must be an 
h e S' 1 * such that | <L (/ 2 )| > 2 for some integer n 2 . Due to the continuity 

of K(f), 

I *«,(/)! >2, /eS BI , 

for some sphere S {2) with its center at f 2 . By continuing this reasoning, 
we establish the existence of a sequence S {1) D S {2) D ••• of spheres in 
which 


\K(f)\ > k > 


/eS (W , 
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for some n k , k = 1,2,.... Since is complete, Ofc=i S (k) contains at 
least one element/ 0 , for which we have according to the above construc¬ 
tion 

\K(fo)\ > k > k = 1, 2 ,... . 

Thus, fa(f 0 ), fa(fo),--- is not a bounded sequence, contrary to the assump¬ 
tion of the theorem. This means that if fa , fa ,... are bounded for each 
/e^f 3 , then fa , fa ,... are uniformly bounded on at least one sphere 
S 0 = {h: || h - h 0 || < r} in 

I fa(f)\ ^ C 0 , /e S 0 , w = 1, 2,... . 

Consequently, we get by exploiting the linearity of fa(f) 

fa(g) = (llr)(fa(rg-h 0 )+fah 0 )). 

Since for g e S 1 wt have \\g\\ <1 1 and therefore rg — h 0 e S 0 so that 

I fa(rg ~ K)\ < C 0 , 

we have established that fa , fa ,... are uniformly bounded on : 

2C 

\fa(g)\<^ Q.E.D. 

^Theorem 2*9* The adjoint A* of a linear operator with domain 
of definition 2 A = is a bounded operator on ^f 3 . 

Proof. Since = ^f 3 , ^4* exists by Theorem 2.5. Assume that the 
theorem is not true for A, i.e., that A* is not bounded. Then there must 
be a sequence fa , fa ,... e 2 A * such that || fa ||, || fa ||,... is bounded while 
II A*h n || —► +oo forn —► + oo. Since || fa ||, || fa ||,... is bounded, we have 

(2.4) lim || A*g n || = +oo, ||* tt || = 1, 

n-> oo 

for the normalized vectors g n = || h n || _1 h n . 

The functionals fa defined by 

fa{f) = <gn\Af\ feJT, 

are obviously linear and defined everywhere on because 2 A — . 

Moreover, these functionals are bounded, 


I fa(f)\ = KA*g n |/>| <M*?»||||/||, 
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and therefore continuous (see Theorem 2.1). Since in addition the 
sequence <f> x ,(f > 2 ,... is bounded for every fixed/e 

I Mf)\ = \<gn I Af>\ < II g n II II Af\\ = II 4/11, 

we can apply Lemma 2.2. According to this lemma the functionals 
</>! , </> 2 ,... have a common bound for all/e 

\Uf)\ H/ll, »= 1,2,.... 

But then we get by setting / = A *g n 

\MA*gn)\ = \\A*g n \\*^c\\A* gn \\, 

which implies that 

M^nll^C, » = 1,2,..., 

contradicting (2.4). Q.E.D. 

From Theorem 2.9 we can immediately deduce that the following 
theorem is true. 

Theorem 2.10 ( Hellinger and Toeplitz ). Any symmetric operator 

A defined on the entire Hilbert space ^ is bounded. 

The above theorem is evidently true, since AC A* and the fact that 
a — & imply that A = A*, while on the other hand A* is bounded by 
Theorem 2.9. 

We note that in the Hellinger-Toeplitz theorem we have that A = A*, 
and therefore A is closed by Theorem 2.8. In §3 we shall generalize this 
theorem to any closed operator defined on a dense domain. 

Exercises 

2.1 ♦ Show that the functional 

m-m, 

is linear but it is not continuous in the norm 

r i»+ co tI/2 

11/11 = U 

2.2. Show that the functional 



196 


III* Theory of Linear Operators in Hilbert Spaces 


is linear and continuous inL 2 ^ 1 ), and find a g(x ) eL( 2 )(IR 1 ) such that 

4>(f) = f g*(x)f(x) dx = <£ |/>. 

J 0 

2*3* Let A — || a ik || be the operator acting on || c k || g l 2 (n) in the 
following manner: 

n 

A || c t || = || b t ||, bi = £ a iu e h . 

k =1 

Show that the above operator is linear and bounded, and that its adjoint 
is represented by the Hermitian conjugate matrix || ||, 


A* || c i || = || b( ||, 


b i = I <**c h . 

k=1 


2 A* Prove that the operator 




is an unbounded linear operator on L 2 (U 2 ). 

2,5. Let (/1 g), f, g g ^f 3 , be a bilinear form on the Hilbert space Jf 7 
which is bounded, i.e., 

\(f\g)\<C\\f\\\\g\\. 

Prove that there is a bounded linear operator A on ^f 3 , with || A || ^ C and 
3 A ~ , such that 

(f\g) = <Af\g>- 

Show that (/1 g) is Hermitian if and only if A* = A. 

2*6* Prove that if A* exists and A is any number, then (A^l)* and 
(A I Aj exist (in .A. | A, A stands for the operator which multiplies 
each vector by the number A) and 

(A^4)* = A *A*, (A + A)* = A* + A*. 

2*7* Show that if A and B are bounded operators with 
3 A = 3 B = Jf, then (A + B)* = A* + B* and (AB)* = B*A*. 

2*8* Prove that if A* and B* exist, then (A + B)* D A* + B* if 
3 A+B = and (AB)* D B*A* if 3 AB = JT. 

2S* Show that if A* exists and A C B, then B* also exists and 
B*CA*. 
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2AO* Quote the theorems in this and the preceding sections which 
show that every Hermitian linear operator on a finite-dimensional 
Hilbert space is self-adjoint. 


3♦ Orthogonal Projection Operators 

3.1. Projectors onto Closed Subspaces 
of a Hilbert Space 

The simplest type of operator, which is of essential significance in the 
spectral theory of self-adjoint operators as well as in quantum mechanical 
problems, is the orthogonal projection operator. As the name suggests, 
this concept of a projection operator acting on a Hilbert space 34? is a 
generalization of the notion of an orthogonal projection of a vector 
r in (1R 3 ) onto a line or a plane in (IR 3 ). For instance, if we have a line 
through the origin of [R 3 [i.e., a one-dimensional subspace of (IR 3 )], 

M = {te: — oo < t < +oo} 

determined by the unit vector c, || c || = 1, recall that the orthogonal 
projection of a vector r onto this line is the vector 

r M — ( e * r) c. 

Theorem 3*1 ♦ Let M be a closed linear subspace of the Hilbert 
space 34?. Denote by M x (or © M) the linear space (see Exercise 3.1) 
of all vectors orthogonal to M: 

M- 1 = (h: (h\g') — 0, g e M}. 

Then each vector/e 34* can be written uniquely as a sum 

(3.1) /=/'+/", /'eM, f"eM\ 

and the mapping 

(3.2) /->/' = feJT, /'eM, 

is a linear operator defined on the entire called the projector (or 
orthogonal projection operator) onto M. 

Proof. The possibility of the decomposition (3.1) for each /e^ is 
guaranteed by Lemma 2.1. In order to prove the uniqueness of this 
decomposition, assume that 


f=fi+fi, //'eM- 1 - 
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Then we have 

/' -A' = -if" - A")- 

Since, on the other hand, /' — // e M, /" — //' e M- 1 -, and therefore 

/' - A' 1/" - A". w e get 

</' -A' I/' — A'> = -</' -A' \f" — A"> = 0, 

which implies/' —// = 0 and/" —//' = 0. Thus, the uniqueness of 
/' and/" for each/e ^f 7 is established, and consequently (3.2) defines a 
mapping of into M. 

To prove that this mapping is a linear operator, note that if 
*=*'+**, g"eM\ 

we have 


af+bg = (af' + 6/) + (af" + ^"), 
where, due to the linearity of M and M 1 , 

af' bg' e M and af " + e M- 1 . 

Since the decomposition (3.1) of any vector in is unique, we get 

( a f + bg) f = En(af + bg) = af' + bg' = a£ M (/) + 6£ M (£), 
which proves the linearity of E M . Q.E.D. 

If M is a closed linear subspace of a Hilbert space Jif and E M is the 
projector on M, then /' = 2? M / is called the projection of the vector 
/e Jf 7 onto the subspace M. The following theorem provides a simple 
recipe for computing projections in practically important cases. 

Theorem 3*2* If M is a separable closed linear subspace of a (not 
necessarily separable) Hilbert space ^f 7 , and if {e 1 , e 2 ,.••} is an ortho¬ 
normal basis in M, then 

(3.3) $ 1 /= £<«*!/>«*, /ejr, 

k 

where 2? M is the projector on M. 

Proof ‘ In order to prove that 2? M / is given by (3.3) when {e 1 , e 2 ,...} 
is a (countable) orthonormal basis in M, note that 

g’ = Z<«S l/> e k 

k 
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exists and / gM (Chapter I, Theorem 4.6) for all /e . Furthermore, 
g" — / — /gM 1 because for any h e M we get 

<g”\h} = </1 A>-£</| «*><«* |A> 

k 

= </' I h> — Z </' I «*><«* | A> = 0, 

& 

where in deriving the third expression we used the fact that 

</" I h > = </" | *> = </" | * 2 > = - = 0, 

and then ParsevaPs relation (4.15) of Chapter I was applied to the 
orthonormal basis {e 1 , e 2 ,...} in M. Thus, we have 

f = g'+g\ g'e M, g"eM\ 

and therefore, due to the uniqueness of the decomposition (3.1), 
g'=f' = E M f. Q.E.D. 

An illustration of the above theorem is provided by the special case 
of a one-dimensional space M. If e is a normalized vector from M, then we 
have 

*m/=<«!/>*. 

In Dirac’s notation the projector 2? M is denoted by | e)> (e |, thus providing 
a handy mnemonic device for writing* the above formula in the ket 
formalism: 

(3.4) E M f = (| «><« |) |/> = | *><* |/> - <* |/> 

It is immediately noticeable that in defining the projection on a closed 
subspace M, the spaces M and M- 1 play a symmetric role. In fact, we see 
from (3.1) that 

(3-5) E„ + E mx = 1. 

M 1 is called the orthogonal complement of M in We say that is the 
orthogonal sum of the subspaces M and M 1 , and express this statement 
symbolically by writing 

= M © 

More generally, we define the orthogonal sum of subspaces as follows. 


* In mathematical literature, instead of writing | e><e |, one would write <e | •> e. 
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Definition 3*1* Let be a family of mutually orthogonal closed 
linear subspaces of a Hilbert space i.e., if M 1 , M 2 e then Mj 1 M 2 . 
The closed linear subspace 

© M = [I fife U Mil 

spanned by (J Me ^ M is called the orthogonal sum of the subspaces M e JF. 

In the case where ^ can be written as a finite or infinite sequence 
{IV of closed subspaces of their orthogonal sum is also 

denoted by 

k 

It is significant to note the similarity between the notation for the 
direct sum of Hilbert spaces and the orthogonal sum of subspaces of a 
certain Hilbert space. It will soon be evident that the direct sum of the 
closed linear subspaces M x , M 2 (which are themselves Hilbert spaces) 
is unitarily equivalent to the orthogonal sum of M 1 , M 2 . 

3.2. Algebraic Properties of Projectors 

Theorem 3*3* (a) A linear operator E defined on the entire Hilbert 

space Jff is a projector if and only if it is self-adjoint, 

(3.6) E = E *, 
and idempotent , 

(3.7) E = E *. 

(b) The closed linear subspace M on which a projector E defined on 
projects is identical with the set of all vectors / e satisfying the 
equation 

(3.8) Ef = f . 

(c) || E || — 1 for any projector E which is different from the zero 
operator 0, i.e., which projects on a subspace M ^ {0}. 

Proof . Let E M be the projector on the closed linear subspace M of Jf. 
For arbitrary/, g e Jf? we get by using the familiar decomposition 

/ =/' +r, g = g f +g\ f\g r e m, r, g " e m-\ 

that 

</1 e mS > = </' +/" i g 'y = </' i g f y = </' i g f + g "y 

— <E„f I £>> 
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which proves that 2? M * = E M . Furthermore, we see from the above that 
for any / e 

<E M f \ g> - </' I g'> = <E„f | E M gy = (Etff I g y 

for all g g Jf 7 , which implies that E M f = E M 2 f, i.e., E M = E M 2 . Note that 
in particular 2? M / = /when/e M! 

Assume now that a linear operator E , with 2) E = is such that 
E — E* and E = 2? 2 . Denote by i? the set of all vectors f g for which 

Ef =/• 

is a linear subspace of Jf 7 , since if /, £ e i?, then 

E(af + A#) = aEf + = a f + bg 

for all scalars a , 6. Moreover, i? is closed; namely, if the sequence 
fi 9/2 >••• g R converges to some vector f g Jf 7 , we get 

hm \\ Ef f k \\ = lim \\ Ef Ef k \\ =0 

00 «-*oo 


by using the fact that E is bounded by Theorem 2.10, and therefore 
continuous (Theorem 1.3). Since f x ,/ 2 ,... can have only /as their limit, 
we conclude that Ef = /, i.e., fe R. 

In order to establish that E is the projector on R y decompose each 
vector g g Jf 7 in the following way: 

g = Eg + (g- Eg). 

But Eg g R , as we can see by using the relation E 2 = E , 

E(Eg) = E 2 g = Eg. 

On the other hand, g — Eg g R l , since for any f g R 
<g ~ Eg |/> = <g — Eg \ Efy = (E(g - Eg) |/> = ^ |/> = 0, 

which shows that E = E R . 

The above considerations establish the points (a) and (b) of the 
theorem. Since by (3.1) 

ll/ll 2 > II/' II 2 = II ^m/II 2 , 

we have || 2? M || ^ 1. If E ^ 0, there must be at least one vector ^ 0 
which belongs to M and therefore satisfies (3.8). Thus, || E^f x || = \\f x ||, 
which shows that || E^ || = 1. Q.E.D. 
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Theorem 3.3 provides a very useful criterion for verifying whether a 
linear operator is a projector, without having to find the closed subspace 
in which it projects (see Exercise 3.2). It will be used so frequently in the 
following that no explicit mention will be made of the fact that the 
theorem has been applied while carrying out a certain step. 

Theorem 3*4* Two closed subspaces M and N of are orthogonal 
if and only if E M E N = E N E M = 0. 

Proof. Since E M = E M *, we have 
(3.9) <E M /|E N £> =</|E M E N £> 

for all /, g e Jf. If M N, as we have E M fe M and E N g e N for all 
/, g e Jf 7 , we get from (3.9) that </1 E M E N £> = 0. Conversely, if 
E M E N = 0 and /,g are any vectors from M and N, respectively, due to 
£„/ = / and E N g — g we obtain from (3.9) / _L g- 
Since E M E N = 0 implies that (see Exercise 2.7) 

= 0 = (E M E N )* = 9 

we see that the statement E M E N = 0 is equivalent to E N E M = 0. Q.E.D. 

3.3. Partial Ordering of Projectors 

If M and N are two subspaces and M C N, we shall write E M ^ E N . 
The following theorem gives us two additional criteria for this relation to 
hold. 

Theorem 3*5* The following three statements about the projectors 
E M and E N defined on Jff are equivalent: 

(a) M C N, i.e., E M ^ E N ; 

(b) E„E n = E N E M = E M ; 

(c) ||E M /||<||E N /||forall/e^. 

Proof. If M C N, then E M /e N for any/e Jf 7 , and consequently, by 
(3.8), 

^n(Em/) == E M / 


for all fe Hence, E M — E N E M , which implies 

|| E M /1 | = || E M E N /|| < || E m || || E N /|| < || E N /|| 
since || E M || < 1 according to Theorem 3.3(c). Thus, (b) implies (c). 
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Finally, to prove that (c) implies (a), note for any closed subspace M x 
we can infer from 

n/ii a = ii^M 1 /ii a + ii^M 1 vir 

that/e Mj (i.e., / = E^J) if and only if 

ll/ll = 11VII* 

If (c) is true, then whenever/e M, we have 

ll/ll = ||£ m /II <I|£ n /I|. 

Combining the above with the fact that || 2? N /|| < ||/|| for any f e , 
we deduce that 


II/II =11 £n/I|. 

This means, according to the aforesaid, that/e N, thus proving that 
M C N. Q.E.D. 

The reason for the particular notation ^ used in designating the 
relation between any two projectors satisfying the conditions of the 
preceding theorem is that this relation obviously introduces a partial 
ordering in the set of all projectors acting on Jf. 

Definition 3.2. A set S is said to be partially ordered if a relation 
£ ^ rj is defined for some pairs £, rj of elements of S, which satisfies the 
following conditions: 

( 1 ) f < f for all f e S; 

( 2 ) if £ ^ rj and 77 < £, then £ — 77 ; 

( 3 ) if £ < 77 and 77 < £, then £ < £. 

Other examples of partial ordering of a set are provided by the “equal 
or smaller” relation ^ holding between real numbers, and by the relation 
C of set inclusion holding between the sets in a family of sets. 

3.4. Projectors onto Intersections and Orthogonal Sums 
of Subspaces 

Theorem 3.6. The product 2? M 2? N of two projectors 2? M and E N 
in is a projector if and only if E M and E N commute, 


(3.10) 
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If this is the case, the set L = M n N is a closed linear subspace of and 

Ei == • 

Proof. If the linear operator i? M 2? N is a projector, by Theorem 3.3, 
it must be Hermitian, and consequently 

E„E n = (E„E n )* = £ n *£m* = . 

Conversely, if E M and E N commute, then i? M 2? N is Hermitian, 

(£ m £n)* = E H *En* = E n E„ = E M E N , 


and idempotent, 


(EmE n ) 2 = E„(E n E„) E h = E M 2 E N 2 = E"E H , 


i.e., Em E n is a projector. 

To prove the second part of the theorem, assume that (3.10) is true, 
and denote by L the closed subspace of onto which 2 ? m 2?n projects, so 
that E l = E m E n . Then we have 

\\EJ\\ =\\E N E„f\\ <||£ M /II, 

\\EJ\\ =\\E„E N f\\ <||^ N /||, 

for all i.e., according to Theorem 3.5 E L < 2? M and E L < E N , 

and consequently LC Mn N. Conversely, if fe M n N, then 
E L f = Em(E n /) — E M f = / and therefore /e L, i.e., LDMnN; thus, 
L = M n N. Q.E.D. 

Theorem 3 The sum 


E Mi + Em 2 + •** + E^ n 


of a finite number of projectors defined on is a projector if and only if 

(3.11) E Mi E Mj = 0 for i ^ j; i,j = 1,..., n , 

i.e., if and only if , M 2 ,..., M n are mutually orthogonal; in that case 

(3.12) Em = Em x + Em 2 + ••• + £ Mtc , 


where 

(3.13) 


M = M 1 ©M 2 ®-®M n . 
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Proof . The linear operator 

A — £ Ml + ••• + Etf n 

is Hermitian, A = A*, since it is a sum of bounded Hermitian operators, 

A * = £$! + *“ + ^m b — ^Mj. + *“ + = A. 

Thus, i is a projector if and only if A = A 2 . This will be the case 
whenever (3.11) is satisfied, since then 

n n n 

A 2 = Yj Vm. == Z = X ^M < = A. 

i,j =1 i= 1 i=l 

Conversely, assume that ^4 is a projector. Then for i ^ j 
(3.14) 

</1 £m/> + </1 E Mj fy < £ </1 W> 

fc=l 

= <j\m <</i/>, 

where in deriving the first inequality we have used the fact that for any 
projector 

</1 £/> = </1 £ 2 /> = <Ef I £/> >0, /G JT, 

and in deriving the second inequality we employed the property that 
|| A || < 1 if A is a projector. 

From (3.14) we obtain 

\\E„ i f\f + \\E Mj f\\*^\\f\\*. 

If we set above / = E Mj f, we get 

II En£n.f\\* < 0, 

which implies E M E Mj f = 0 for all f e , i.e., that M L ,..., M TC are 
mutually orthogonal (see Theorem 3.4). 

To prove (3.12), assume that (3.11) is satisfied, so that A is a projector. 
Denote by M the space on which A projects, i.e., A = E n . If/e M, then 

f = Af — EnJ + - + E M Je M x © - © M re , 


which shows that 
(3.15) 


M C M, © ••• © M„ . 
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Suppose that# e Mj @ ••• © M n . By Definition 3.1,# is in the closure 
of the linear manifold spanned by M 1 U ••• U M n . This means that there 
is a sequence g k , g 2 e (ULi M fc) 

a = orU) ... _i_ p( n ) 1) c M p(w) c M 

Ok 0 ]c \ nr Sjc 9 Sjc c 1 'i c 1 'n ’ 

converging in the norm to#. From the mutual orthogonality of M x M n 
we get 

n n 

ll s k —g t II 2 = Z < g ^ I - ft W) > = Z ll 4’’ - g™ ll 2 

i,j =1 3=1 

by using that fact that #J. l) — #^ e for any k, l — 1, 2,... . Thus, for 
any fixed i = 1,..., n, 

lltf* 0 -^/Ml < ll^-^JI-^0 for k,l-+co, 

so that #i t} , # 2 t) ,... is a Cauchy sequence in . Since M t . is closed, this 
sequence has a limit g {%) . Thus, we can write our arbitrarily chosen 
element# from M 1 © ••• © M n in the form 

# =# (1) + +# (w) , # (1) eM 1 ,...,#< w >eM n . 

Moreover, since ±_ M^- for i j, we have 

E^.g {j) = &ag {i) - 


It follows that 

n n 

(3-16) Ag = £ E nt gM = £ g“> = g, 

i,3=1 3=1 

for the arbitrarily chosen #e Mj © ••• © M n . From the above relation 

(3.16) we deduce that# e M, thus proving that 

(3.17) MDM 1 ©-@M tt . 

The relations (3.15) and (3.17) combined together imply (3.13). Q.E.D. 

Theorem 3*8* The difference E M — E N of two projectors is a 
projector if and only if N C M; in that case 

Z?m — E n = Em©N j 


where M © N is the orthogonal complement of N with respect to M: 

(3.18) M © N ={/:/eM,/J_N}. 
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Proof . Suppose that E M — E N is a projector, and let L be the space on 
which it projects: 

Em — Eu — E l . 

Thus, E m is a projector which satisfies the relation 


Em — Ei E N , 

In this case we have according to Theorem 3.7 that E L E N = 0 and 

M - L ® N. 

Conversely, if N C M, then by Theorem 3.5 
Em = • 

This implies that the evidently self-adjoint operator 2? M — E N is idem- 
potent, 

(Em — E N y — Em — ~ — Em ~ > 

i.e., E m — E n is a projector. Q.E.D. 

We shall now generalize Theorem 3.7 to the case of a direct sum of a 
countable infinity of closed subspaces. 

Theorem 3*9* If M 1 , M 2 ,... is a sequence of mutually orthogonal 
linear subspaces of a Hilbert space and M is their orthogonal sum 


M = M.QM,®-, 

then 

(3-19) lim||(£ M -i £ m ,)/|=0 

for every / e Jf. 

Proof. By applying Theorem 3.7 to 2 ? Mi + ••• + ^m tc we easily 
derive 


n n / 71 \ 

Z IIWII 2 = I = (I VI/ 

= <^M ie ...©M„/l/> < ll/ll 2 , 

which shows that the series Y^ _, || £ M _/1| 2 converges. Consequently, 
the sequence 

liv » = 1.2.... 

U-i 


(3.20) 
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is a Cauchy sequence, as can be seen from the relation 

II m+r m2 m+r 

Z W = £ II^mJII 2 . 

k=m k=m 

If we denote the limit of the sequence (3.20) by/', the operator Af = /' 
is obviously linear and defined everywhere in Moreover, we have 

<^/U> = lim(f; E„J\g) 

x fc=l ' 




-!™ (i e «j | i E *S) - w i *>, 

which shows that A — A* and A 2 = A. Thus, A is a projector. 

Denote by M the linear subspace of on which A projects. If/e M, 
then 

f — Af = E^J + E^J + ••• g M x ® M 2 (T) ? 


i.e., we must have M C M x © M 2 © ••• . 

If we assume that M ^ M x © M 2 © •••, there would be a vector which 
belongs to M x © M 2 © ••• but not to M. Since M 1 © M 2 © ••• is a closed 
linear subspace of a Hilbert space and as such is a Hilbert space itself, we 
could deduce from Lemma 2.1 the existence of a nonzero vector 
# g Mj © M 2 © ••• which is orthogonal to M. This implies that# _L , 
k = 1, 2,..., because we obviously have || 2? M /|| <M/II for all/e^, 
and therefore M x ® M 2 © ••• C M. 

On the other hand, by Definition 3.1 of orthogonal sums, g is the limit 
in the norm of linear combinations of vectors from M 1 , M 2 i.e., we 
have vectors g ± k g M x ,..., g * g , k = 1, 2,..., such that 

lim ||# - (#!* + ••• + #*) || = 0. 


Consequently, since# _LM 1 ,M 2 ,..., we have 

<g I g> = lim <# | #! fc + •••+#£> = 0, 

«-» OO 


which shows that g = 0, thus contradicting the assumption that 
M # Mj © M 2 © - . Q.E.D. 
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Theorem 3*10* If E Mi , E M2 is a sequence of projectors defined on 
Jj? which is monotonically increasing 

^ E„ 2 ^ , 

or monotonically decreasing 

E m x ^ ^ ••• , 

then we have for any fe 

lim ||(£ M ~ £ m tc )/II = 0, 

n-» oo n 

where E M is the projector onto (see Chapter II, Definition 1.2) 

M = lim M„ . 

n-»oo 

Proof . In the case of a monotonically increasing sequence, apply 
Theorem 3.9 to the mutually orthogonal projectors (take M 0 = {0}) 

— En k _ v k = 1 , 2 ,... 

to deduce that 

lim |(£'m - £m„)/II = lim |(£ M - £ /1| = 0, 

k =1 

where _ 

M =©(M,©M i _ 1 ) = ,0 M t . 

fc=l &=1 

The case of a monotonically decreasing sequence can be reduced to 
the above case by considering the projectors E — 1 — E M „ . Q.E.D. 

*3.5. Appendix: Extensions and Adjoints 
of Closed Linear Operators 

By using the concept of orthogonal complement, we can prove the 
following theorem which then easily yields the generalization of the 
Hellinger-Toeplitz theorem (Theorem 2.10) to any closed operators. 

^Theorem 3*1 L If A is a closed linear operator in Jf? and A * exists, 
then the domain of A* is dense in Jf 7 , and A** = A. 

Proof. If 3t A * is not dense in 3^ y there is a nonzero vector h orthogonal 
to 2f A * , i.e., (h | g) = 0 for all g e &> A * . But then we also have 

<0 | A*g> + <A | -*> = 0. 
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In terms of the inner product | *> 2 in @ Jf, the above relation yields 

(3.21) <(0, A) | (^*£,-£)> 2 = 0 
for all g e 2 A *. 

According to Theorem 2.5, which contains the definition of A*, the set 

(3.22) {(A % -g): ge® A *}Cjr@jr 

is the set of all the points (£*, —g) e satisfying (2.3). This set 

consists of all the points in which satisfy the relation 

(3.23) <(£*> —g) I (/> 4/*)> 2 = 0 

for all fe2 A , i.e., which are orthogonal to the graph of A . The 
graph is a linear subset of because A is linear, and moreover, 

it is closed since A is closed. Thus, the set (3.22) is the orthogonal 
complement G^ of the closed subspace G^ of © Jf. By (3.21), (0, h) is 
orthogonal to G^ and consequently, (0, h) e G A . This implies that 
h = AO, i.e., h = 0, thus establishing that 2 A * is dense in 

Since 2 A * is dense in Jf, (^4*)* = A ** exists by Theorem 2.5. The 
graph of ^1** consists of all the points (/,/*) of Jf 7 © satisfying 

(A*g |/> = <g |/*>, ge@ A *, 

i.e., for which 

(3.24) <(A*g,-g)\(f,f*)} 2 =0. 

This implies that G^** — (G^)- 1 = G^ and consequently G^** = G^ , 
which means that A = A**. Q.E.D. 

Let A be a closed linear operator with 2 A — Jf. By Theorem 2.9 A* 
is a bounded linear operator with 2 A * = , and again by the same 

theorem, A** is also a bounded linear operator with 2 A ** = Jf. On the 
other hand, by Theorem 3.11, ^4** — A, and therefore we can state 
Theorem 3.12. 

Theorem 3*12 (The closed-graph theorem). All closed linear operators 
defined on the entire Hilbert space are bounded. 

It is also easy to deduce from Theorem 3.11 the following useful 
result. 

^Theorem 3*13* The adjoint A* of the linear operator A has a 
dense domain of definition in if and only if A has a closed linear 
extension; in that case ^1** is the smallest closed linear extension of A , 
i.e., any closed linear extension of A is also an extension of A**. 
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Proof . According to Theorem 3.11, if A is closed, then == and 
^4** == ^4, so that Theorem 3.13 stands. 

Conversely, if & A * — then ^4** exists. From the relation 

<A*g |/> = <£ 14/X ge®A*> 

we obviously get ^4** 3 A Moreover, if B is a closed extension of A , 
then its graph is closed in ® and it contains . However, as 
can easily be seen from (3.23) and (3.24), we always have G A ** — (Gi) x - 
This implies that G^** is the closure of G^ , i.e., G^** C G s and, therefore, 
A**CB. Q.E.D. 

Exercises 

3*1 ♦ Let S be a nonempty subset of a Hilbert space . Prove that 
the set S 1 - of all vectors/in Jt which are orthogonal to S 

S ± = {f:<f\g>=0,geS} 

is a closed linear subspace of St*. 

3*2* Use Theorem 3.3 to show that the linear operator A, acting on 
the two-dimensional space / 2 (2), which is represented (see Theorem 1.1) 
in some orthonormal basis of / 2 (2) by the matrix 



is a projector. 

33* Show that any projector E on an w-dimensional Hilbert space Jf? 
can be represented by a matrix of the form 



in a suitably chosen orthonormal basis {e x £ n }, where the number of 
1 *8 in the diagonal (the rank of E) is equal to the dimension of the space 
on which E projects. 
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3*4* Prove that any 2x2 Hermitian matrix either is a multiple 
A E (A = A*) of a matrix E representing a projector on / 2 (2), or it can be 
written uniquely as A 1 E 1 A 2 E 2 , where E 1 and E 2 represent projectors 
on / 2 (2), and X x , A 2 are real numbers. 


4* Isometric and Unitary Transformations 

4.1. Isometric Transformations in between Hilbert Spaces 

We encountered unitary transformations in Chapter I (Definition 2.4) 
when defining the concept of isomorphism of two Euclidean spaces. 
The concept of isometric transformation presents a straightforward 
generalization of the concept of a unitary transformation, and since it 
will be of fundamental significance in scattering theory, we shall define 
it now. 

Definition 4*1 ♦ A linear transformation T of a Hilbert space 
into another Hilbert space 34? 2 (with inner products <• | •>! and <• | *) 2 , 
respectively) which preserves the inner product, i.e., 

<771 Tg> x = <J\g\ 

for all /, g e M P 1 , is called an isometric transformation. If in addition the 
range 34 T of such a transformation is the entire space 34? 2 , i.e., 34 T — 34? 2 , 
then the transformation is said to be unitary. 

We have the following weaker condition of isometry. 

Theorem 4*1* The linear transformation T of into Jf 2 is an 
isometric operator if 

(4.1) lir/|| 2 = ||/||i 

for all / e 34? x . 

Proof. To prove the above statement, first substitute / + g for / in 
(4.1) to derive 

<T{f + g) I T(f + g)\ = || T/|| 2 2 + <7/1 Tg\ + (Tg | Tf\ + || Tg || 2 2 

= (/ + £!/ + £)i 


= II/Hi 2 + </l£>! + is l/>i + 11^ III 2 * 
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then / + ig to obtain 

<T(f + ig) I T(f + ig)\ = || Tf || 2 2 + i(Tf | 7» 2 - i<7* | Tf\ + || Tg || 2 2 
= <f+ig \f + ig>i 

= \\f\\i 2 + i<f\g>i-i<g\f>i + \\g\\i 2 - 

After multiplying the above equation by —i arid adding it to the one 
preceding it, we arrive at the result 

<Jf\ T g>2 = <f\g>i 

for all f,g£ . Q.E.D. 

It is easy to see that an isometric operator , mapping into is 
necessarily unitary if is finite dimensional (see Exercise 4.1). However, 
in the case of linear operators on infinite-dimensional Hilbert spaces these 
two concepts do not coincide. For instance, if {e x , e 2 ,...} is an ortho¬ 
normal basis in the Hilbert space Jf 7 , then the shift operator 

A t f = A t ( J <** | /> 4 = £ <e, | /> e k+l 
\k=1 J k=1 

is obviously isometric, but not unitary, for any fixed / — 1 , 2 ,.... 

The relation that characterizes a unitary operator is U* — £/ -1 , where 
U~ x is the inverse of U. The definition of the inverse T~ x of a linear 
transformation T is contained in the following statement. 

Theorem 4*2* Let T be a linear transformation with domain & T 
and range . The inverse mapping T -1 of & T into Qj t exists if and only 
if the zero vector Oi of 3) T is the only vector mapped by T into the zero 
vector* 0 2 of & T . If T~ x exists, then T~ x is a linear transformation with 
domain = 0t T and range ^ r _i == S) T . 

Proof. For the existence of the inverse transformation T -1 of the 
linear transformation 

(4.2) /-*/' = T/, feB T9 f'e@ T , 

it is necessary and sufficient that f g (i.e., / — g ^ 0 X ) implies 
Tf Tg [i.e., T(f — g) 0 2 ]. The existence of the inverse of (4.2) is 
thus obviously equivalent to the requirement that Tf = 0 2 has only the 
solution/ = 0 X . 


Naturally, 0 a — if and belong to the same vector space. 
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If the inverse transformation T -1 is defined on the linear set , it is 
obviously linear, as can be easily seen in the following way: if we have 
/ = T~ y' and g = T^g', f',g'e 2 T -i = 3% T , it follows that f=Tf 
and g' = Tg, and consequently for any scalars a , b , 

a f' + ¥ = aT f + hT g = T ( a f + b 8)> 


i.e., 

af + bg = T-\af'+W). Q.E.D. 

4.2. Unitary Operators and the Change 
of Orthonormal Basis 

Theorem 43 ♦ A linear operator C/, defined on the entire Hilbert 
space Jff, is unitary if and only if 

(4.3) u*u = C/C/* = 1, 

i.e., if and only if C/ -1 = C/*. 

Proof. A unitary operator U is obviously bounded (|| U\\ = 1) and 
consequently its adjoint U exists and is defined everywhere in Since 

<f\g> = <Uf\U g y = <f\U*U g y 

is true for all /, g e Jf 7 , we conclude that C/* U = 1. Moreover, from 
0t v = ^ we deduce that in the relation 

<Uf\g> = <f\ U*g} = <Uf I C/C/*£> 

C/jf assumes all the values in Jt? when/takes on all the values in This 

implies that UU*g = g for all g e Jf 7 , i.e., C/C/* = 1. 

Conversely, if (4.3) is true for some linear operator C/, we have 
C/ -1 = C/* and therefore 

9t v = 

Furthermore, (4.3) also implies that U is isometric: 

<uf\u g > = <f\u*u g > = <f\g>. 

Thus, we have proved that U is unitary. Q.E.D. 

One of the most important features of unitary operators is that they 
perform the transformation of one orthonormal basis into another. 
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Theorem 4*4* (a) If {e 1 , e 2 is an orthonormal basis in the 

Hilbert space then {Ue 1 , Ue 2 is a ls° an orthonormal basis. 

(b) Conversely, if {e 1 , e 2 and {tf/, e 2 ',...} are two orthonormal bases, 
there is a unique unitary operator U such that e k ' = Ue k , k = 1, 2,... . 

Proof, (a) The set {Ue 1 , Ue 2 is orthonormal, since due to the 
unitarity of U 

<Ue t | Ue f > = <C/*C/*, \ efr = <*< | e>) = . 

To see that {Ue 1 y Ue 2 is also complete, assume that for some/e Ji? 

<f\Ue k y=0, k = 1, 2,... . 

Then we also have 

<U*f\e k y =0, & = 1,2,.... 

The above relations imply that C/*/ = 0, due to the completeness of 
{e 1 , e 2 ,...}. Since C/C/* = 1, 

/ = U(U*f) = 0. 

(b) Given [e 1 ,e 2 ,...} and {g/, £ 2 V--}> we can define an operator U, acting 
on all/e J#* as follows: 

(4.4) Uf=ulf j (e k \fy J = £ <** |/> 

\k =1 / fc=l 

The series on the right-hand side converges in the norm since the 
sequence 

n 

K=Y J <- e k\ />«*'» n = 1 , 2 ,..., 

k=1 

is a Cauchy sequence; this statement is a direct consequence of the 
relation 

m 

\\h m -K\\ 2 = £ l<e* l/>l 2 

ft=W+l 


and the convergence of the series 

f io* i/>i a = ii/ii 2 . 

k-l 
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It is trivial to check that the operator U defined by (4.4) is linear. 
Furthermore, we have for any/, g e 


(4.5) <Uf\ Ug) = lim(£ <e i \f)e/ 

\*=1 3=1 


= E <*!/>**«<«< I *> = </!*>• 

i,j=1 


Hence, the operator U is isometric. From its definition (4.4) it is 
obvious that the range 31 v of U is the entire Hilbert space Jff. Therefore, 
U is a unitary operator. Q.E.D. 


4.3. The Fourier-Plancherel Transform 

The Fourier transform—which is the best known of all integral trans¬ 
forms and is frequently used in quantum mechanics—can be looked upon 
as being essentially the restriction of a unitary operator to some linear 
submanifold of an L 2 ((R n ) space. In order to prove this unitarity property 
of the Fourier transform, we need a few auxiliary lemmas and theorems. 

Lemma 4*1 ( The Riemann-Lebesgue lemma). If f{x) is a complex 
function which is Lebesgue integrable on the interval /, I C [R 1 , then 

(4.6) lim j* f(x) sin Xx dx = lim j* f(x) cos A x dx =0. 

Proof. We shall prove the lemma only for the sine integral, since an 
analogous proof applies to the cosine integral. 

Assume that I is a finite interval, for instance that I = [a, b]. Take a 
function g(x) which is differentiable on [ a , 6], and for which (see 
Exercise 4.2) 

f \f(x)-g(x)\dx < e/2. 

J a 

Integrating by parts we derive 

f g(x) sin A* dx = - M^t^McosXa_ 

l r b 

+ y g'(x) cos Xx dx —► 0 for A —► oo. 

X J a 

Consequently, for sufficiently large values of A, 
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and therefore for such A, 


J /(*) si 


sin Xx dx 


< 


/»b /»b 

£(#) sin A x dx + |/(#) — £(#)| | sin Xx | dx 


< e, 


which proves the theorem. 

The case of infinite I can be reduced to the above case by chosing a and 
b, a < b, so that 

f |/(*)| dx <e. 

J I-ia.b ] 

In that case 

f f(x) sin Xx I < f |/(#)| dx + f f{x) sin Xx dx 

j I \ j J—[a,b] d a 


< 


f(x) sin Xx dx 


-j- e, 


and the previous considerations apply. Q.E.D. 

Lemma 4*2* If f(x) is once continuously differentiable and Lebesgue 
integrable on the interval I in [R n , and if 0 e (R n is an internal point of I, 
then 

/4m .. 1 r r, sin Xx t sin Xx n . v 

(4.7) lim — f(u + x) -•••- d n x = f(u). 

' A-*oo 7 T n J J X ± x n 

Proof. We shall give the proof for the case n — 1. The general case 
can be easily reduced to this case (see Exercise 4.3). 

Since the origin of R 1 is an interior point of /, we can choose a > 0 so 
that [— oc, -f-a] Cl. Then 

(4.8) f f(u + x) S * n - — dx 

J j x 

sin Xx . r sin Xx 


f "f(u+x) S -^dx+ f f(u+xy 

J -a, X J /—[—Q£,+a] 


■ dx. 


X 7^ 0 


where by Lemma 4.1 the second integral approaches zero as A —► + oo. 
The function 

l f(u + x) -/(«) 
h(x) — | x 

\f'(u), x = 0 

is a continuous function of x because f(x) is continuously differentiable. 
Therefore, after writing 

( 4 -9) /_/(“ + *) dx 

C +0L sin Xx , , C +0L f(u + #) —f{u) 


(•+« sin Ax , 

=m )_ a — dx + \. 


- sin Xx dx. 


x 
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we can apply Lemma 4.1 to the second integral on the right-hand side 
of (4.9) to infer that this integral vanishes in the limit A —► oo. Since 


lim 

A-»oo J _ 


X 


we obtain (4.7) by combining (4.8) and (4.9) and then letting A —> oo. 

Q.E.D. 

Theorem 4*5* If f(x) e *€\U m ) is Lebesgue integrable on R m y then 
its Fourier transform* 

(4.10) f(p) - f f(x) er«'*)v» d™x 


is bounded and continuous on [R m , and if D A = [—A, -\-X\ x ••• X 
[-A, +A], 

(4.11) f(x) = (2nfi)- m/2 lim f f(p) d m p. 


If, in addition, f(x ) is of compact support in [R m , then 
(4.12) f J/(*)r*"*= f \f(P)\*d**p. 

J R rn J R m 

Proof. The integral on the right-hand side of (4.10) is uniformly 
convergent in p due to the fact that f(x) is integrable and 

| f(x)e~ {i/h)v ‘ x | ^ |/(#)|. 

Hence, f(p) exists, it is continuous, and by Lemma 4.1 it is bounded. 

In order to derive (4.11), first apply Fubini’s theorem (Chapter II, 
Theorem 3.13) to invert the order of integration, and then Lemma 4.2 
to obtain upon setting x' = /i -1 / 2 #, p f — fi^^p. A' = fr -1 / 2 A, 


(2rrh)~ m/2 \im f f(p)e (i,n)x ' v d m p 
A-^co J Da 

= lim(27r) -m f — f* d m p r f f(y)e i v' ,(x '- y,) d m y r 
A-»oo J _a J _a J 

= lim^-f f{y)^ 

J r» x i 


sin A(V —y/) sin A(x m ' - yj) ^ 


J m 


.. m C c/ \ sin X'u t sin A 'u m _ r/ v 

= lim 7 T~ m fix + u) --- d m u — fix). 

^ \m Ui U m 


The definition found in mathematical literature corresponds to the choice h — I. 
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If f(x) e ^((R m ) then P(p)f(p) is bounded for any polynomial P(p) so 
that/eL^lR” 1 ), and we can apply Fubini’s theorem to obtain 

f I/(*)l 2 d m x = (! 2 f d m x f*(x) f d m pf(p) e {i/n)x ^ 

J u m J u m J u m 

= (27rh)~ m/2 f d m pf(p) f d m xf*(x)e {i/n) * mas 

** U m ^ M m 

= f \f(P)\ 2 d m p. Q.E.D. 

Theorem 4.5 constitutes one of the most important results in the 
theory of the Fourier transform. We are interested, however, in the 
Fourier transform as an operator on a L 2 ((R m ) space. In that context the 
following result is of greatest interest to us. 

Theorem 4*6* There is a unique linear and bounded transformation 
C/ F of L 2 (U m ) into itself, which maps every element / e ^(U m ) into 

/ = U P f, f e «E(R~), fe L 2 (R-), 

where f(p) is related to f(x) by the Fourier transform (4.10); C/ F is a 
unitary operator, called the Fourier-Plancherel transform on U m . 

Proof. The integral operator defined by (4.10) and having as 

a domain of definition is obviously linear. Since ^(U n ) is dense inL 2 (lR m ) 
(Chapter II, Theorem 5.6), and since by the relation (4.12) in 
Theorem 4.5 the present operator is isometric on ^(!R m ), it has according 
to Theorem 2.6 a unique bounded linear extension to L 2 ((R m ). This 
extension U F is isometric (see Exercise 4.4). 

In order to prove that C/ F is unitary, i.e., that 0t v — L 2 (U m ), it is 
sufficient to show that it maps a dense set onto another dense set. This 
last property follows from Theorem 4.5; namely, we easily see from 
(4.10) and (4.11) that the dense set consisting of all of functions f(x) e 
^°°(lR m ) for which xl 1 ••• x]% djj ••• d^f(x) is bounded on R m for any 
H >•••, i m , ji >—>j m = 1> 2,... is mapped onto itself by U F . Q.E.D. 

*4.4. Cayley Transforms of Symmetric Operators 

An important category of isometric operators are the Cayley trans¬ 
forms of symmetric operators A, 

(4.13) V = (A — i)(A +i)~\ 

which play a key role in the theory of self-adjoint operators—as we shall 
see later in this chapter. 
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^Theorem 4*7* If A is a ( closed) symmetric operator in a Hilbert 
space ^f 7 , its Cayley transform V — (A — i)(A + i)~ x exists as a linear 
operator in Jf 7 , and it is an isometric transformation of the closed 
subspace 3$ A+i onto the (closed) subspace 2t A _ i . 

Proof. The existence of the operators {A ± i)~ x follows from the 
relation 

(4.14) || (A ± f)/ll* = <4/ 1 4f> T Kf I Af> ± i<Af\f> +</|/> 

= M/ll 2 + ll/ll 2 , fe9 A , 

and Theorem 4.2, since according to the above relation (A ± i)f — 0 
implies / = 0. 

We shall prove now that the ranges 3% A±i of the two operators A ± i 
are closed linear subspaces of M* if A is closed. 

Since 3i A±i are the ranges of linear operators, they are linear. To show 
that 31 A+i and 3i A _ i are closed sets, we first derive from (4.14) that 

||/|| < 11(^4 ± 0 / 11 , fe# A . 

Thus, if the sequence // — (A -f- 0/i y/ 2 ' — (A + i)f 2 ,... converges to 
some/' e Jf 7 , the sequence/! y f 2 ,... also converges to some/e . Since 
A -f- i is obviously closed when A is closed (see Definition 2.4), we must 
ha ve/e 2 A and/' e 3$ AJhi . 

A similar argument establishes that 3% A __ i is closed too. 

The operator (A + z) _1 is defined on 3& A+i and its range is 2 A , while 
A — i is defined on 2 A and its range is 3%^ . Consequently, 
V — (A — i)(A + i)~ x maps 2t A+i onto 3& A _ i . To establish that V is 
isometric, note that (4.14) yields 

IK- 4 — £)g\\ = IK" 4 + *)g\\> • 

If we insert above g = (A + z) -1 /, f G ^a+% > we g et 

\\{A-i){A +0-VII = 11/11 
for all/e 3i A+i , i.e., V is isometric. Q.E.D. 

In the above Theorem the term “closed” is optional, but it should 
be noted that each densely defined symmetric operator A has at least 
one closed extension, namely ^1**; we see immediately that A** Z) A 
by looking at the relation 

<A*f\g> = </| Ag\ fe@ A *, 

and recalling that since 2 A *D2 A and 2 A is dense in 2 A * itself is 
dense in and therefore^** exists and is closed (see Theorem 2.8). 
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Moreover, as we shall see later, A** C A*, and since obviously 
A* C A***, it follows that A** is symmetric. 

^Theorem 4*8* If the Cayley transform V of the linear operator A 
exists, then 3 A — 3t x _ v and 

(4.15) A =i( 1 + V)(l - V)~\ 

Proof. According to Definition (4.13) of V , 3 V consists of all vectors 
g for which there is an f e 3 A such that 

(4.16) g = (A + i)f. 

When V is applied to the above g> we get 

(4.17) Vg = (A — i)f. 

By adding and then subtracting (4.16) and (4.17) we obtain 

(4.18) (1 + V)g = 2Af, (1 - V)g = 2 if 

The second of the above relations shows that (1 — V)~ x exists; namely, if 
(1 — V)g = 0, the aforementioned relation implies that / = 0, which 
due to (4.16) implies in its turn that g = 0, thus establishing, in 
accordance with Theorem 4.2, the existence of (1 — F')~ 1 . Hence, we 
can write 

g = 2*(1 - V)-'f, 

which in conjunction with the first of the relations (4.18) yields 

Af=i(\ + V)(\-V)~'f fe3 A . 

Moreover, from the second of the relations (4.18) we see that 
^(i-v)- 1 = ® A > so that the domains of definition of A and 
*(1 + V)(l — V)- 1 are identical, i.e., (4.15) is true. Q.E.D. 

The following theorem is of great practical importance in establishing 
the self-adjointness of a symmetric operator. 

^Theorem 4*9* The symmetric and densely defined operator A is 
self-adjoint if and only if its Cayley transform is a unitary operator. 

Proof ’ Assume that A = A*. We shall prove that the domain of 
definition & A+i of V is dense in 

Assume that h , i.e., that (Jt | {A + i)/> = 0 for all fe 3 A , 

or, equivalently, that (h | (A + 0/) = <0 |/> for all/ g 3 A . According 
to Theorem 2.5 we have h e 3( A+i )* and 

(A + *)* h = (A* - i) h = (A — i) h = 0. 
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It follows from (4.14) that (A i i)h = 0 only if h = 0. Thus, R A+i is 
dense in 34?, and since 34 A+i is closed (see Theorem 4.7), we must have 

Sty = & A +i = 34? • 

It can be proved in a completely analogous manner that 
0l v = 8% A _i = The identities @1 A±i = 34? establish, in conjunction 
with Theorem 4.7, that V is unitary when A — A*. 

Assume now that V is unitary and A* exists. We shall prove that in 
this case A = A*, where A* exists since 2 is dense in 3?. 

Take g e 2 A * and set g* = A*g. Then 

<g\A/y = <g*\fy 

for all/e 2 A . According to (4.15), for each/e 2 A there is an h e 2 V = 34? 
such that / == (1 — V)h and Af — i{ 1 + V)h. Hence 

Kg\(i + v)hy = <g*\(i-V)hy 


for all h e Jf 7 , or 

Kg I h > + Kg I Vh > = <g * I h > ~ <g * I Vh>. 

Due to the unitarity of V, 

<g I hy = < Vg I Vhy and <£* | h) = < Vg * I Vhy 
and the preceding relation yields 

<— ig — iVg — Vg* + g* | Vhy = 0 

for all h e 34?. Since V is unitary, Vh assumes all the values in 34? when h 
takes on all the values in 3?, and therefore 

*’(1 + V)g - (1 - V)g* = 0. 

Using the above relation we obtain 

which shows that g G 0Z (1 _ V) = 2 A ; the same relation yields 

*xi + — = s*, 

which shows that g* = Ag. Thus, A* C A, and since A is symmetric, 
A * = A. Q.E.D. 
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We have already seen in Theorem 4.7 that Si A+i and Si ^ are closed 
linear subspaces of when A is a closed symmetric operator. Let us 
denote by m and n the dimensions of the ortogonal complements of 
Si A _ t and Si A+i , respectively, 

m — dim Si A _ { , n — dim &&A+i * 


Naturally, m and n can assume any nonnegative integer values, including 
+ oo when the corresponding space is infinite dimensional. Moreover, 
if is not separable, we have to distinguish between infinite dimensions 
of separable and nonseparable subspaces. 

The two numbers m and n are called the deficiency indices of A , and the 
two subspaces Si^_ t and SH^ +i are the deficiency subspaces of A. According 
to Theorem 4.9, A is self-adjoint if and only if m = n — 0. 

If the deficiency indices of A are equal but not zero, then we can easily 
extend the Cayley transform V of A to a unitary operator V 1 on This 
can be achieved by choosing any two orthonormal bases {e x \ e 2 ^..} anc ^ 
{^i", £ 2 ",...} in Si A _i and S$ A+i , and extending V to Si A+i by writing 
Vtfk' = e k '. Since this extended J+ is unitary, the operator 

Ai = i(l + ¥,)(! - V,)-' 

is self-adjoint by Theorem 4.9, and it obviously provides an extension 
of A. Hence, if m — n > 0, the closed symmetric operator A will have 
many self-adjoint extensions. If, however, m + w, then A has no self- 
adjoint extensions. As a matter of fact, if A 0 were a self-adjoint extension 
of A, then the Cayley transform V 0 of Aq would be unitary, and it would 
be an extension of the Cayley transform V of A. Since obviously V 
maps ^t A _ t isometrically onto Si A+i , this would imply that m — n, 
which contradicts the assumption. In summary, we can state the 
following theorem (see also Lemma 7.1 in Chapter IV). 

Theorem 4* 10* A closed symmetric operator A has self-adjoint 
extensions if and only if its deficiency indices m = dim Si A _i and 
n = dim Si^ +i are equal; moreover, A = A* if and only if m = n = 0. 

Theorems 4.7-4.10, and in particular Theorem 4.10, indicate that the 
self-adjointness or essential self-adjointness of a Hermitian operator in an 
infinite-dimensional Hilbert space is by no means a “natural” feature of 
such an operator. On the contrary, there are many more symmetric 
operators which have no self-adjoint extensions than symmetric operators 
which possess self-adjoint extensions; namely, to any isometry which is 
not the restriction of a unitary operator there corresponds a closed 
symmetric operator without self-adjoint extensions. The reader should 
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keep this in mind when trying to decide within the context of some 
quantum mechanical theory whether or not some obviously Hermitian 
differential operator represents a self-adjoint operator. 

4.5. Self-Adjointness of Position 

and Momentum Operators in Wave Mechanics 

As an easy application of the results obtained in this section, we can 
prove the self-adjointness of the /-coordinate position operator Qj and the 
canonically conjugate momentum operator Pj . 

We define Qj as the operator with domain 3 Qj consisting of elements/ 
of L 2 ((R m ), representable by functions f(x 1 ,..., x m ) which are such that 
Xjf(x i,..., x m ) are Lebesgue square integrable, and acting on fe S) Qi in 
the following way: 

(4.19) (Qjf)( X 1 >•••> x m) ~ x jf ( x 1 >•••> X m)9 f G ®Qj • 

The above operator is obviously symmetric (see Exercise 4.5). 

To prove that Qj* = Qj , we will show that the Cayley transform 
(Qj ~~ i)(Qj + 0 -1 unitary (see Theorem 4.9). 

The ranges of Qj i * are L 2 (U m ); e.g., if g eL 2 (U m ), we can write 

g( x i*m) 

h( x l vj X m) 

where obviously h e 3 Q . Moreover, the operator (Qj — i)(Q 3 - + i) _1 is 
isometric, 

11(0,- - i)(Q, + = f IVTT T l/(*i >•••• ^)l 2 dm * 

J R \ x j ~T 1 I 

l/(*l,...,*m)| 2 ^ = ll/l| 2 

and since its domain and range are (see Theorem 4.7) & Q . +i and 3% Qj -i , 
respectively, this operator is unitary. Hence, Qj is self-adjoint. 

We define Pj by 

(4.20) P, = U?Q f U F , = U ?@ Qi , 

where U F is the Fourier-Plancherel transform introduced in Theorem 
4.6; in (4.20) we used a very useful notation, according to which if T is 
any transformation mapping the set S 1 C 3 T onto S 2 we write 



— ( x j “I - 0 h(x i ,..., x m ), 

= g(x 1 X m ) 

Xj -|- i 


( 4 . 21 ) 


S 2 = TS X = {Tf-.f e SJ. 
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The self-adjointness of Pj follows from the unitarity of U F and the 
following lemma. 

Lemma 4 3* If A is a self-adjoint operator in Jf 7 , and U is a unitary 
operator on Jf 7 , then the operator 

B = UAU~\ = U9 a 

is self-adjoint. 

Proof. Since obviously because (see Exercise 

4.6), it follows that jB* exists. We must have B* D UA^U -1 — B due 
to the following relation: 

<f\B g y = </| UA*u-' g y = <t/-y| 

= t/-y i i g> = < 5 /1 *>, /, * e . 

By inverting the roles of A and B , and noting that C/ _1 is unitary, we get 
A*D U~ 1 B*U, which implies that B* C UA^U -1 = UAU -1 = B. 
Hence, B = B*. Q.E.D. 

As a result of the above considerations we can state Theorem 4.11. 

Theorem 4*1 L The wave mechanical canonical position and 
momentum operators Qj and Pj , defined by (4.19) and (4.20), are self- 
adjoint. 

In order to see that (4.20) is nothing but a precise definition of the 
symbol —i^djdx^ note that for/belonging to the set [which 

according to Exercise 4.6 is dense in L 2 ([R m ) since ^J(R m ) is dense] we 
have 

PJ = U?QJ, 

where 

(Qjf)(Pl 9-9 Pm) = PofiPl 9 -, Pm) = g(pl 9-9 Pm)- 

Hence 

(Pjf)( x 1 >•••>■ x m) ~ (^p^X^l »•••» 

= {lirhy^ f pjfa d™p 

= —*(2wS) _m/a S- f /(/>! ,...,p m ) e {imv ’ x d m p 

0X 3 J 05« 

^f( x l ’•••’ x m) 

= dx } " ’ 
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where the interchanging of the order of differentiation and integration 
in carrying out the third step in the above derivation is easily deducible 
from standard theorems of the calculus, when the fact that / e ^(lR m ) is 
taken into account. 

Exercises 

4*1 ♦ Prove that an isometric operator V defined on a finite¬ 
dimensional Hilbert space is necessarily unitary. 

4*2* Prove that iif(x) is Lebesgue integrable on [a, b], there is for 
any given e > 0 an infinitely many times differentiable function 
g(x) e 6]) such that 

f \f(x)-g(x)\dx < e. 

J a 

4*3* Prove in detail Lemma 4.2 for any integer n. 

4*4* Prove that if A and B are bounded linear operators satisfying 
A C B, and if 3 A is dense in 3 B , then B is isometric when A is isometric. 

4*5* Show in detail that the operator Qj defined in the subsection 4.5 
[see (4.19)] is linear and Hermitian, and that Qj *2 Qj. 

4*6* Show that under a unitary transformation the image of a dense 
set is also a dense set. 


5* Spectral Measures 

5.1. The Point Spectrum of a Self-Adjoint Operator 

We encountered the eigenvalue problem when determining the 
bound-energy spectrum of a quantum mechanical system from the 
wave mechanics point of view. In this section we shall formulate the 
eigenvalue problem for symmetric operators in general. 

Definition 5*1 ♦ A complex number A is said to be an eigenvalue 
(<characteristic value ) of the linear operator A, with domain of definition 
3 A , if there is at least one nonzero vector/ e & A satisfying the eigenvalue 
equation 

(5.1) Af = A/. 

Each vector / e 3) A satisfying the above equation is called an eigenvector 
(or characteristic vector ) of A with eigenvalue A. All the eigenvectors with 



5♦ Spectral Measures 


227 


the same eigenvalue A form a linear subspace M A , which is usually 
referred to as the characteristic subspace corresponding to the eigenvalue A. 

If M A is infinite dimensional, then M A is not necessarily closed. 
However, it is easy to see that the characteristic subspaces of closed 
linear operators are always closed. 

Theorem 5*1* If the linear operator A is symmetric, all its eigen¬ 
values are real and the characteristic subspaces corresponding to different 
eigenvalues are mutually orthogonal. 

Specialized versions of Theorem 5.1 were proved in Chapter I 
(Theorem 5.1) and Chapter II (Theorem 5.3). The proof in the general 
case proceeds along the same lines. 

To derive that an eigenvalue A is real if A is symmetric, choose any 
nonzero eigenvector / and use its symmetry property to write 

A</l/> = </l A/> = </| Af> = (Af\fy = <A/|/> = A*</|/>, 

thus showing that A = A*. 

The orthogonality of two characteristic subspaces M Aj and M Ajj , 
corresponding to two different eigenvalues A x and A 2 , follows from the 
orthogonality of any two respective eigenvectors e M Ai and / 2 e M Ajj : 

Ai</i I /»> = <Ai/i |/ 2 > = <4/i I /.) - </i I 4/2> = </i I A 2 / 2 > = A 2 (fi |/ 2 >. 

The set Sp of all eigenvalues of an operator A is called its point spectrum 
(see also Definition 3.3 in Chapter V). 

5.2. Spectral Resolution of Self-Adjoint Operators 
with Pure Point Spectrum 

Consider a closed symmetric operator A for which the closed linear 
space spanned by its eigenvectors is the entire Hilbert space Jf 7 (an 
operator with a so-called pure point spectrum ), i.e., for which, by 
Definition 3.1, 

^=©M A) 

AgS p 

where Sp denotes the point spectrum of A. We assume that Jf 7 is 
separable, so that Sp is necessarily at most countable, and hence it can 
be written in the form 

= {Ai > A 2 ,...}. 

By using Theorem 3.9 we get 

£ E Hx /=E Jt f=f. 
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If we have for s ime / 

(5.2) £ I HI 2W/II 2 < +oo, 

*=i 

then the sequence,/ 2 

n 

(5-3) /» = E W 

fc=l 


converges to/, while , 4/2 

(5.4) Af n = £ ^(£ Ma /) = £ A k (E M f) 

k=1 k =1 

is a Cauchy sequence, due to (5.2). Since is complete, this sequence 
has a limit g. Therefore, A being closed, we must have that f e 2 A and 

Af = g=Z\ k (E M f). 

k=i k 

Vice versa, if/e 2 A , (5.2) must be satisfied. To see that, note that the 
restriction of A to M Ai ® ••• ® M A ^ is a bounded operator and therefore, 
A being closed, it must be defined everywhere on M Ai ® • • • © M A ^ . 
Hence, the vector f n , defined in (5.3), belongs to 2 A . Moreover, A 
obviously maps M Ai © • • • © M A ^ into itself, and therefore 


<- Af n | A(f-f n )y = (A% I f-f n } = o. 


By using this relation we immediately get 

II Af || 2 = || A(f-f n )|| 2 + || Af n ||» > || Af n || 2 , 

which shows, in conjunction with (5.4), that (5.2) must be satisfied if 

The above arguments can be applied to A*, A* D A, to derive that 
2 A * consists of all vectors/satisfying (5.2), so that 2 A * — 2 A . Hence, 
we have proved Theorem 5.2. 

Theorem 5*2* If the closed linear manifold spanned by all the 
eigenvectors of a closed symmetric operator A acting on a separable 
Hilbert space Jf 7 is identical with Jf 7 , then A is self-adjoint, and 2 A 
consists of a the vectors/ satisfying (5.2). 
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In the considered case, the complete solution of the eigenvalue problem 
is accomplished when the characteristic subspaces M Ai , M Ag are found, 
or equivalently, when the projectors E M , E M ,... have been specified. 

Al ^2 

However, this formulation of the eigenvalue problem is not sufficiently 
general, because it does not cover the cases of self-adjoint operators 
whose eigenvectors do not span (such as the general case of the 
Schroedinger operator), or do not even have any eigenvalues at all 
corresponding to Hilbert space eigenvectors (such as the position and 
momentum operators in quantum mechanics). Fortunately, one can 
solve the most general case of self-adjoint operators after rephrasing the 
above offered solution of the eigenvalue problem for operators A with 
pure point spectrum in the following way: the knowledge of the projectors 
E m , E m ,... on the characteristic subspaces M A , M A ,... of A is 

Ai M 2 1 2 

equivalent to the knowledge of the projector-valued set function 

(5.5) E(B)= X 5e#, 

Aei?nSp 

for all Borel sets B on the real line. The set function E(B ), B e 0S 1 , whose 
assumed values are projectors, has certain properties which are common 
to a class of projector-valued set functions which are known as spectral 
measures. Moreover, these properties, which we shall formulate and 
study later in this section, make E(B) unique in the sense of E(B) being 
the only spectral measure that can be associated with A. When it is thus 
generalized, the eigenvalue problem becomes the problem of deter¬ 
mining the spectral measure of A, and as such it can be applied to any 
self-adjoint operator—as we shall see in §6. 

5.3. Weak, Strong, and Uniform Operator Limits 

The above approach to the problem necessitates a generalization of the 
concept of measure which would also apply to operator-valued set 
functions. A careful analysis of the concept of limit of sequences of 
operators is essential for that. As seen in the following definition, we can 
distinguish three main concepts of limits for operator-valued functions. 

Definition 5*2* Let A(t) denote an operator-valued function on 
some subset S of R 1 , assigning to each t e S a bounded operator A(t) 
acting on a Hilbert space . We can define A to be a limit of A(t) when 
t tends to t 0 (where £ 0 is a real number or ±°o) in the following three 
ways: 

(a) If for all /, g e 

(5.6) </l Ag) = lim </| A(t)g), 
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A is 

the weak limit of A(t) when t 

, and we write 


> A = 

w-lim Ait) 

t->t 0 

or 


(b) If for all/e-T 




(5.7) 


lim \\(A(t) 

~A)f\\ 

= o, 

A is 

the strong limit of A(t) for t - 

h '■ 



A = 

- s-lim Ait) 
t-*to 

or 

A (^ A - 


(c) Finally,' if || A — ^4(£)|| —► 0 when t —► t 0 , we call A the uniform 
limit of A(t) and write 

A — u-lim Ait) or Ait) => A. 

t-*t 0 t-±t 0 

These three definitions of limits are related in the sense that a uniform 
limit is also a strong limit, and a strong limit is also a weak limit, though 
the converses of these statements are by no means true. Thus, from the 
inequality 

\\(A — A(t))f\\ < M — A(t)\\ ll/ll 

we derive that when A — u-lim^ A(t), then (5.7) is also true. By 
means of the Schwarz-Cauchy inequality we get 

\<f\(A-A(t))g>\ < ||/|| \\(A -A(t))g\\, 

which shows that (5.7) implies (5.6). 

As an example of strong convergence which does not imply uniform 
convergence, consider a sequence E 1 , E 2 ,... of mutually orthogonal 
nonzero projectors. According to Theorem 3.9, there is such a projector 
E that 

n 

E = s-lim V E k . 

"-* 00 ti 

However, the sequence E {1 \ 2? (2) ,..., 

» = 1 , 2 ,..., 

k =1 

is not uniformly convergent, since it does not satisfy the necessary 
condition (see Exercise 5.3) of uniform convergence: 

|| E in+ 1) — E in) || -► 0 as n~+ oo; 
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in fact, since E n+1 ^ 0, we have 

|| £<»+!> _£(»»|| = || E n+1 || = 1. 

Definition 5.2 suggests that there are two different kinds of convergence 
of sequences of vectors in a Euclidean space. 

Definition 5*3* If/(t) is a vector-valued function assigning a vector 
f(t) from the Euclidean space $ to each value t from the set S, S C U 1 , 
we call/a weak limit oif(t) for t -> t 0 , 

/ = w-lim/(<) or f(t) —-/ 

*'~**'0 

if (g /) = lim (W (g f (t)) for all g e we say that / is a strong limit 
of/ (t) for t —»-1 0 , 

/ = s -lim/(0 or /(<) —>/, 

if lim^* o ||/-/(OII = 0. 

From the inequality 

K*l/-/(0>l <11* II11/-/(OIL 

we conclude that if/is a strong limit of/(£), then it is also a weak limit. 
The converse is by no means generally true. 

It has to be emphasised that we have been considering until now only 
limits in the norm for sequences of vectors. The concept of the limit 
in the norm is obviously identical with the concept of strong limit. 

5.4. Spectral Measures and Complex Measures 

We are now ready to define the concept of a spectral measure. The 
reader should review Definition 2.1 in Chapter II of an ordinary measure, 
and note the formal analogies. 

Definition 5*4* A spectral measure E(S) on the measurable space 
(SF, s/) is a projector-valued function which assigns a projector E(S) to 
each element S of the Boolean or algebra s/ in such a manner that 

( 1 ) £(#) = 1 ; 

/ 00 \ 00 

(2) E I (J S k \ = £ E(S k ), if S x , S 2 ,..., is any sequence of disjoint sets 

\Jfc-l J k=1 

from «£/. 

In the last relation above, we define 

oo n 

£ K(S\) = s-lim £ E(S k ). 

/<!«*" J 
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We agree, in general, that given an infinite sequence E x , E 2 ,... of 
projectors, the sum of their infinite series shall always mean, if not 
otherwise stated, the strong limit of the partial sums: 


X E k 

k =1 


s-lim f E k . 


If we choose in the relation 

(5.8) e(\J = £ E(S k ), St n S } = 0 for i * j, 

\k=l / *=1 

S 2 — S 3 — ••*.= 0 we get E(0) = 0. By taking above instead just 
S 8 = S 4 = — = 0, we obtain E(S 1 u S 2 ) = E(SJ + E(S 2 ) if S 1 n 
S 2 — 0 , so that £(S 1 )£ , (S 2 ) = 0 by Theorem 3.7. More generally, 
setting B = R n S, S 1 = R — B, S 2 = S — B for R, S e ja/, we get 
E(R)E(S) = [EiSJ + E(B)][E(S 2 ) + E(B)] = E\B) = E(B) since 
Sj n S 2 = Sj n B — S 2 n B = 0 . Hence we can state the following 
theorem. 


Theorem 5.3. If E(S) is a spectral measure on (^, stf), then 
E( 0) = 0 and E(R)E(S) = E(R n S) whenever R, S e sf. 

The concept of a spectral measure can be related to that of a measure 
most conveniently by introducing the concept of a complex measure. 
For our purposes the following definition of a complex measure is most 
adequate, though it is not the customary one. 

Definition 5.5. A complex measure /x( S ) on the measurable space 
(^, jaf) is a finite linear combination, with complex coefficients c x ,..., c v 

tA s ) = X c ^( s ) 

a=l 

of a given number v of yimte measures /x^S),..., /x^S) on (^, ja/). A 
measurable function /(£), £ e is integrable on Res/ with respect to 
jji if and only if it is integrable on R with respect to each of the measures 
/xj ,..., [jl v ; in that case, by definition, 

(5.9) f mdK£)=tc a f /(f)d^(0- 

Jr a=i J R 

With the above definition of integration most of the results of 
Chapter II, §3 on integration can be easily generalized directly to complex 
measures. Moreover, it is easy to see that the above integral (5.9) is 
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independent of the way the complex measure /x is written as a sum of 
measures. That is, if 

(5.10) MS) = £ C MS) = £ c e '^'(S), 

a—1 0=1 

any function /(f) integrable on R with respect to /xj /jl v is also 
integrable with respect to /x/,..., /x'/ and 

(5.11) f f mdM = £c s ' f mdtoW- 

a=l J R 0-1 J R 

The simplest way to verify (5.11) is by noting that for/(f) = Xs(£)> 
SCR, (5.11) becomes (5.10) and is therefore certainly true. 
Consequently, (5.11) is also true when /(f) is a simple function. 
Following the usual procedure, indicate^ by Definitions 3.4 and 3.5 of 
Chapter II, one can arrive at the verification of (5.11) for the most 
general case when/(f) is an extended complex-valued function on R. 

The reader can easily convince himself that a complex measure is a 
complex-valued or-additive set function on the measurable space {%, stf). 
It is worthwhile to mention that due to a fundamental theorem of 
measure theory* the converse is also true: any finite complex-valued 
or-additive set function on a measurable space {$£, jaf) is a complex 
measure (see Theorem 5.8). 

^Theorem 5*4* A projector-valued set function E(S), S e stf, 
defined on the measurable space (iT, ja/) and such that E(3£) = 1, is a 
spectral measure if and only if for any two given vectors /, g 

(5.12) Pt.o(S) = <f\E(S)g\ S ejtf, 
is a complex measure on {9C , j/). 

Proof. From the easily verifiable relation 

(5.13) </1 E(S) g> = L <f + ig \ E(S)(f + ig)y + i </ + g I E(S)(f + *)> 

- ^ «/1 E(S)f> + <* | E(S) g >) 

containing on the right-hand side only set functions of the form 
(h | E(S)hy, which are obviously measures, we immediately deduce that 
}if' U (S) is a complex measure. 


# Iitihn’s theorem: sec Theorem 5.7 in Appendix 5.6. 
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Assume now that, conversely, (5.12) is a complex measure for any two 
given /, g e 3^. This implies that /x f g (S ) is or additive, i.e., for any 
sequence S 1 , S 2 ,... of disjoint sets from s/ 

(5.14) <f\E(S 1 u S 2 u ••■)*> = u S 2 u •••) 

= i = i <f\E(s k ) g > 

k =1 fc=1 

= lim </|(^(5i) + -+^(5 n ))^> 


By taking in the above relation S 3 = S 4 = • • • = 0, we derive 
£(Si U S 2 ) = £(£,) + £(S 2 ) 

for any two disjoint S ± , *S 2 e Jaf. This implies, according to Theorem 3.7, 
that 

E(SJE(SJ=0 

for any disjoint sets S x , S 2 e j/. Thus, when, in general, the sets in 
S 1 , £ 2 ,... e ja/ are disjoint, 

E £(S«) = s-lim £ E(S k ) 

71-4 00 r™' 

n=l fc=l 

exists by Theorem 3.9, and due to (5.14) we have 

f;£(5 n ) = £(u s\ 

n =1 \n=l / 

This establishes that E(S) is a spectral measure. Q.E.D. 

In defining the concept of a spectral measure E(S) we required that S 
should assume values from a Boolean or algebra instead of an algebra. This 
requirement does not entail any loss of generality, as can be seen from the 
following analogue of Theorem 2.3 in Chapter II. 

^Theorem 5*5* Let E 0 (S) be a projector-valued set function 
defined on a Boolean algebra of subsets of a set SC. Suppose that 
E 0 (3C) = 1 and 



for any sequence S 1 , S 2 ,... of disjoint sets from , for which 
uLi 5, e . In that case there is a unique spectral measure E(S), 
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S e si, defined on the Boolean or algebra si generated by si^ , which is an 
extension of E 0 (S), i.e., E 0 (S) = E(S) for all 

Proof. To prove the theorem, we shall follow the lead suggested by 
the proof of Theorem 2.3 in Chapter II, and introduce for each subset R 
of 9£ the equivalent of an infimum of the family of all projectors 

OO 00 

(5.15) £ E 0 (S k ), \J S k DR, S k es/ 0 , S ( n S k = 0 for i # k. 

k=1 k=1 

This “infimum” E(R) can be defined as the projector onto the closed 
subspace M (R) which is the intersection of all subspaces on which the 
projectors (5.15) project (see Exercise 5.7). In this way, we have defined a 
projector-valued set function on the family of all subsets R of S£, which 
obviously has the property that 

</1 E(R)f> = inf j//| £ E 0 (S k )f): Q S k D R, S k s^J. 

According to Chapter II, Theorem 2.3, </1 E(S)f} y S e si, is a measure 
on si. Hence, as we see from (5.13), it follows that </1 E(S)g ), S e si, is 
a complex measure on si for any vectors / and g. Thus, E(S ), S e si, is 
by Theorem 5.4 a projector-valued measure on si, which evidently 
coincides with E 0 (S) for S e si 0 . Q.E.D. 

5.5. Spectral Functions 

A concept very closely related to that of spectral measures, and also 
very useful when formulating the spectral theorem in § 6 , is that of 
spectral function. We shall give a definition of spectral function in (R n , 
rather than limiting ourselves to the more conventional special case of 
n = 1. 

Definition 5*6* A projector-valued function E x = E Xi . Xn on [R n 

is called a spectral function (or spectral family, or decomposition [; resolu¬ 
tion ] of the identity) if it satisfies the following requirements: 

( 1 ) E x ^ Ey whenever A ^ A', 

(2) E h = E k+0 = s-lim E y , 

A -»A 
A'>A 

(3) E_oo = s-lim E x = 0 and E,^ = s-lim E x = 1, 

A-»—oo A->+°° 

where A' ^ A for A = (AjA n ) and A' = (A/,..., A^') means that 
A, < A,',..., A n < A,„'. 
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^Theorem 5*6* To every spectral measure E(B), on 

(IR n , 38 n ) corresponds a unique spectral function E x , A e IR n , satisfying 

E^,..., a„ = E(I h A J, 

(5 ' 16) K .*. = (-«, AJ X - X (- 00 , AJ, 

and, conversely, to each spectral function on U n corresponds one and 
only one spectral measure on ([R n , 38 n ) satisfying (5.16). 

Proof. If E(B), B g 38 n y is a spectral measure, the projector-valued 
function E h , A e [R n , defined by (5.16) satisfies the first requirement of 
Definition 5.6 since for A' ^ A, 

Ey = E(Iy) = E(I a ) + E(Iy - J A ) > E(I X ) = E A , 

while the other two requirements of Definition 5.5 are satisfied by E x due 
to the continuity from above of the spectral measure E(B) (see Exercise 
5.8); for instance, 

s-lwn E y = s-lim E(I X ) = E{I X ) = £ A . 

A'-»A A'-»A 

To prove the converse, define a projector-valued set function E(B) on 
the Borel algebra 38 q in the following way. For intervals / A defined in 
(5.16), the set function E(I X ) is defined by (5.16). Assuming now that the 
E(B) is defined for two sets B 1 and B 2 , write 

E(B X u B 2 ) = E(B 1 ) + E(B 2 ) if ^0^=0, 

E(B, n B 2 ) = E(B 1 ) E(B 2 ), 

E(B ') = 1 - EiBJ. 

The reader can easily convince himself that by the above indicated 
operations one can extend the domain of definition of E(B) from the 
intervals I x to the family J n of all intervals in IR n , and then to 38 q, an d 
that the derived function E(B ), B g 38 q, is or additive on 38 Since 

E(U*) = s-lim E(I h _ Xn ) = 1, 

E(B), B g 38q, satisfies the condition of Theorem 5.5. Therefore, it has 
a unique extension to 3B n , and this extension is a spectral measure. 

Q.E.D. 

*5.6 Appendix: Signed Measures 

The real and imaginary parts of a complex measure are cr additive but, 
in general, not nonnegative set functions. These set functions belong to 
the class of signed measures. 
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Definition 5*7* A countably additive set function /x(£) on a 
measurable space (^, j/), for which /x( 0) = 0, is called a signed measure 
if it assumes values only from the set [R 1 U {-f-oo} or only from the set 
(R 1 U {— oo}. 

The reason for not allowing a signed measure to assume + oo as well 
as — oo values is to avoid the ambiguities inherent in the addition of + oo 
and — oo. 

We recall that, in general, a measure was not a special case of a complex 
measure since a complex measure is, by definition, finite. Hence, only 
the class of finite measures is a subset of the family of complex measures. 
However, we note that all measures are certainly special cases of signed 
measures. 

One could ask the question whether a signed measure could be 
always written as a linear combination of measures. The following 
theorem provides an affirmative answer to this question. 

^Theorem 5*7 {Hahn's theorem). If /x is a signed measure on the 
measurable space {SC^ <$/), then there are two measurable disjoint sets 
S {+) and S { ~ ] whose union is 2£ y such that /x(£ n £ (+) ) ^ 0 and 
fi{S n S<->) < 0 for all Sesf. 

Proof. It is sufficient to consider only the case when fi(S) assumes 
values only from [R 1 U {-f-oo}. 

Denote by ^ (-) the family of all sets R such that fi{R nS)^0 for 
all S e sf. If 

( 5 - 17 ) V= R 3? t -A R )> 

choose a sequence R x , R 2 ,... e ^ (-) such that 

The sets S k = Ui=i also belong to #' (_) since 

tis ns k ) = £n(sn (R, - U *,)) = X M (tf, n S n (fi */)) < 0 

i =1 j=1 i =1 j=l 

for all S G stf. Moreover, we also have 

KS k ) = /* (u - u *}) = t ** - U 

\i=l \ j=i+l / / i=l \ j=i +1 / 


— fi{R k ) + ^ fi fa ^ (n ^ 

i~i \ \j~i- i-i // 
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and consequently 
The set 

s<->- 0 *=G* 

&=1 &=1 

also belongs to jF (-) since /x(S n S { ~ ] ) ^ 0 for any S gj/, as can be 
seen from the relation 

p(S n S<->) = f MS n (S * +1 - S k )) = lim MS n S,). 

fc=l 

Setting S' = 2E y we also get 

MS'-’) = ^lim M^s) = r- 

We claim now that fi(S n S (+) ) ^ 0 for all Sgj/ if S (+) — 2£ — iS (_) . 

Assume to the contrary that /x(S n S (+) ) < 0 for some S e The 
set S { q +) = Sn S (+) could not belong to ^ (-) , since then we would 
have $ ( 0 +) U S { ~ ] e J r(_) and 

^Wu5H)=^W)+^)<y, 


which contradicts (5.17). Hence, S£ +) has measurable subsets of positive 
measure /x. Let n x be the smallest positive integer for which there is a 
measurable subset S[ +) of S^ +) such that /x(Sj +) ) > \ jn x . Since 

/W> - S< +) ) = riS<+>) - /*(£<+>) < /*(s< +) ) < 0, 

we can apply the same argument to S^ +) — S[ +) to infer the existence 
of a smallest positive integer n 2 such that S { 0 +) — S[ +) contains a measur¬ 
able set S£ +) with /x(S£ +) ) ^ l/w 2 . We can continue with this procedure, 
thus obtaining the sets S£ +) , S£ +) ,... . Then 

/* (0 = i 

\k =1 / fc=l &=1 k 

and consequently 

/* (^ + > - 0 ^ +> ) < 0 . 

v &=1 ' 

For arbitrary Sgj/ we have 
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In fact, if that were not so for some / — k 0 , then n ko would not fulfill 
the conditions of its definition since we could choose n kQ < n ko so that 

M (sn(sw-U5w)) >±- 

' ' k=l ’ ’ U k 0 


and replace Sj+ ] by 

( fco 1 v &o—1 

S<+> - (J £;<+>) C S<+> - 0 £!<+>. 

k=l k=l 

The sequence n x , n 2 diverges to infinity since 

I ^ < £ tis?) = - p (s‘+> - u s<+>), 

&=1 &=1 ' &=1 

and the measures of S { 0 +) and Sq +) — ^ =1 are nonpositive and there¬ 

fore finite. By combining this result with (5.18) we obtain 

i*(sn(sw-0^ , ))<o 

for all S estf. This would imply that Sq +) — is a set which 

belongs to «^ (-) , is of negative measure, and is disjoint from S { ~K Hence, 
we would have 

f* (S'-’ ^ (^ +) - 0 <S‘+>)) < = Y, 

which contradicts (5.17). Q.E.D. 

It is easy to see that if fi(S) is a signed measure, then the set functions 
/x (+) (S) = n(S n S {+) ) and /x ( “ ) (»S) = — n(S n S { ~ ] ) are measures. 
These measures are called the upper variation and the lower variation of 
/x, respectively. According to the above theorem, we can always write 

/x(S) = ^(+>(5) - /x ( ->(*S), Sej/. 

Using this result we arrive at the following conclusion. 

Theorem 5*8* Any countably additive set function /x(S) on a 
measure space (iF, j/) for which /x( 0) = 0 and which assumes values 
from C l is a complex measure (in the sense of Definition 5.5). 

In order to establish the above theorem we only have to note that 
Re fx(S) and Im /x(S) are signed measures and therefore they can be 
written as the difference of their respective upper and lower variations. 
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Exercises 

5.1. Show that in an n-dimensional Hilbert space the eigenvalue 
problem 

Af = Xf 

can be reformulated as a matrix eigenvalue problem 

n 

Y A^ k X k ^i > Aik I Ae-f), l/"X 

k=1 

in some orthonormal basis {e x e n }. 

5.2. Show that every linear Hermitian operator A acting on an 
ft-dimensional Hilbert space Jf 7 has a pure point spectrum containing 
at most n points. 

5.3. Prove that if A 1 , A 2 is a uniformly convergent sequence of 
operators on Jf 7 then for each e there is an N(e) such that || A m — A n || < e 
for all m, n > m- 

5.4. Let {e 1 , e 2 ,...} be an orthonormal basis in an infinite-dimen¬ 
sional Hilbert ,space Denote by | e k }<(e k | the projector (3.4) onto e k . 

Show that on account of Theorem 4.6(b) in Chapter I, 

n oo 

1 = s-lim £ | e k y<e k | = £ | e k y<e k \. 

n ~ >C ° k=l k=l 

5.5. Show that when s-lim A(t) exists, then 

lim || ^40O/|| = || s-lim ^4(*)/|| for all fe 

t^tQ 

5.6. Show that if ^4 = s-lim ^ 1q A(t) and B = s-lim t ^ to B(t), then 
s-lim t _> t A(t) B(t) exists and is equal to AB in the case that || ^4(£)|| < c 
for all t. 


5.7. Show that the intersection of all the subspaces in any given 
family of closed linear subspaces of a Hilbert space Jf 7 is a closed linear 
subspace of Jf 7 . 

5.8. Prove that any spectral measure E(B ), B e j/, is continuous from 
above and from below , i.e., if B = lim^oo B n , then 

E(B) = s-lim E(B n ), 

n->oo 

where B k D B 2 D • • • D B and B k C B 2 C • • • C B, respectively. 

5.9. Prove that || A* || = || A ||. 
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5* 10* Show that if A — u-lim^ A t and B — u-lim ^ ^ B t , then 
A B = u-lim (A t + B t ), 

t-»f 0 

AB = u-lim A t B t , 

t-*t 0 

A* = u-lim A* 

*-»*o * 

5*1 L Explain why any infinite orthonormal sequence e 1 , e 2 ,... in 
a Hilbert space is weakly convergent to 0, but it does not converge 
strongly to any vector. 


*6* The Spectral Theorem for Unitary 
and Self-Adjoint Operators 


6.1. Spectral Decomposition of a Unitary Operator 

We shall first state the spectral theorem for unitary operators, and 
later on proceed to prove it in a few stages. 

^Theorem 6*1 ♦ To every unitary operator U on a Hilbert space 
corresponds a unique spectral function E x having the properties that 
E 0 = 0, E 27J = 1, which is such that 


U = u-lim X e U <E Xk - E^), 


Vl < K < K’ 


6 = k™*,n ^ “ Vl). 


where the limit is taken over finer and finer partitions of [0, 2tt\ : 

(6.2) 0 = A 0 < Aj < ••• < A w = 2tt. 

Since uniform convergence implies weak convergence, we can write 


(6.3) </1 Ugy = e^d(f\E Ag y 

J [0.2tt] 

for any two vectors /, g e Jf?. 

It should be noted that since E Xl ^ E Xi for ^ A 2 , we have 

0 for A < 0 

1 for A > 2tt. 


(6.4) 
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Thus, the support of E(B) is within the set [0, 2 77 ], i.e., 

E(B) = 0 if B n [0, 2tt] = 0 . 

Consequently, the integration in (6.3) can be extended over U 1 . 

The formula (6.1) is customarily written in the symbolic form 

(6.5) U = f e fA dE x , 

J R 1 

and as such is known as the spectral decomposition of U. 

The basic idea of the construction of the spectral function E x consists 
in defining E x as the strong limit of monotonically decreasing sequences 
of polynomials in U and U~ x . We define the yet unfamiliar concept of a 
monotonic sequence of bounded operators so as to obtain a generalization 
of the already familiar case for projectors (see Theorem 3.10). 

6.2. Monotonic Sequences of Linear Operators 

Definition 6*1 ♦ If A is a symmetric operator in and c is a real 
number, we write A ^ c or A ^ c if, respectively, </ | Af) < c</1/> 
or </1 Af> > c(J |/> for all/ e 2 A . In that case A is said to be bounded 
from above or below , respectively; A is said to be positive if A ^ 0. 

If A and B are two symmetric operators on ^ and A — B ^ 0, then 
we write A ^ B. It is very easy to check that the above relation ^ is a 
partial-ordering relation (see Exercise 6.2). 

Theorem 6*2* If A ± y A 2 is a sequence of symmetric operators 
on which is monotonically increasing (i.e., A x ^ A 2 ^ •••) or mono¬ 
tonically decreasing (i.e., A x > A 2 ^ •••), and bounded, respectively, 
from above (i.e., A k ^ c for all k and some constant c) or from below 
(i.e., A k > c), then A x , A 2 ,... converges strongly to a limit A, which is a 
symmetric operator on Jf. 

Proof. Consider the case A x < A 2 < ••• . Since the operators A x , 
A 2 ,... are symmetric and defined everywhere on Jtf", they are bounded 
(see Theorem 2.10). 

We shall assume that all the operators A x , A 2 ,... are positive; the 
general case B x ^ B 2 ^ ••• can be easily reduced to this case by setting 
Ak = B k + || B x ||: 

</1 A k fy = </1 Bjy + </11| b x ||/> ^ </1 Bjy + || b x || <f\fy> o. 

To prove the strong convergence of A x f A 2 f } ..., note that A i — A$ ^ 0 
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if i >/. For any positive symmetric operator A we have the generalized 
Schwarz-Cauchy inequality (see Exercise 6.1): 

(6.6) \(g | Af)\ 2 < </| AfXg | Ag). 

By inserting above A = A i — Aj and g = (A i — Aj)f, we get 

(6.7) {{A i -A j )f\{A i -A j )f) 2 

< </1 (A t ~ A,)fX(Ai - A,)f | ( A , - ^)*/>. 

Let c be an upper bound of the sequence A 1 ^ A 2 ^ ••• , so that 
</1 ^/> < c ||/|| 2 for all i = 1, 2,... . From the inequality 

if\(A i -A j )fy ^j\Aj) ll/ll 2 

we get by using the relation (6.6) with A — A i — Aj , 

l<* I (A- - A-)/)l 2 < * a ll/ll 2 II g II 2 , f,ge*- 

The above inequality implies that || A i — A 3 - 1| ^ c. Using this result in 

(6.7) we obtain 

(6.8) || (A, - i4,)/ll 4 < < 3 ll/ll 2 K/l AJ> - </1 AJ) |. 

Since 

</l A/> <</l^ 2 /> < - << ll/ll 2 , 

the above sequence of numbers is convergent, so that the term 
|</ | AJ) — J | AJ )| can be made arbitrarily small for sufficiently 
large i and /. In the light of this remark, (6.8) yields the strong 
convergence of AJ , AJ ,... . 

The linearity and Hermiticity of A , where 

Af = s-lim AJ 

n-* oo 

is easily derivable from the corresponding properties of each of the 
operators A x , A 2 ,... (see Exercise 6.3). Q.E.D. 

6.3. Construction of Spectral Families 
for Unitary Operators 

Assign to every trigonometric polynomial 

+n 

p( eiQ3 ) = Z c ^ i1c<p 

k*~-n 


(6.9) 
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the operator p(U), where 

+n 

(6.10) p(U)= £ 

k=—n 

Then we can state the following lemma. 

Lemma 6 A* The mapping 

( 6 . 11 ) Pi^^PiU) 

of the family of all trigonometric polynomials ( 6 . 9 ) into the family of 
linear operators on has the following properties: 

(6.12) + *iPi(J") -► a iPi( u ) + «2 Pz(U) 9 

(6.13) h{*)P^)^h(U)PAU)> 

(6.14) (PV"))*^(P(U))*- 

Moreover, if p(e icp ) ^ 0 for all 9 e R 1 , then p(U) is a positive operator. 

Proof. The verification of the relations (6.12)-(6.14) is straightfor¬ 
ward algebra and is left to the reader (Exercise 6.4). 

In order to prove that p(U) ^ 0 if p(e i(p ) ^ 0, we use a lemma by 
Fejer and Riesz (see Exercise 6.5) which states that if p(e i<p ) ^ 0, then 
there is another trigonometric polynomial q(e iw ) such that 

p(e iQ) ) = | q{e i(p )| 2 = [q(e iQ) )]* q(e iQ) ). 


Using (6.13) we get 

<f\p(U)fy = (f\( q (U))* q (U)fy 

= <q(U)f\q(U)fy^o, 

which shows that p(U) ^ 0. Q.E.D. 

We can extend the mapping (6.11) to the family of all functions 
w(cp) y 9 g R 1 , which are the limits 

(6.15) w(cp) = lim p n (e iQ) ) 

n ->°0 

of monotonically decreasing sequences 

(6.16) Pi(e iq> ) > P<f? icp ) > >0 

of positive trigonometric polynomials. This result constitutes the 
following lemma. 
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Lemma 6*2* To each function w(cp) corresponds a symmetric 
positive operator A w on 

A w = s-lim p n (U), 

n->co 

which is independent of the chosen monotonic sequence (6.16). The 
mapping 

(6.17) w(<p)-^A w 
is such that 

(6.18) w x ( 99 ) w 2 (<p) -> A WiWz = A Wi A W2 . 

Proof. Due to the last statement in Lemma 6.1, we have for any 
sequence (6.16) that 

Pi(U)>p 2 (U)>->0- 

Hence, by Theorem 6.2, the above sequence has a strong limit A, which 
is a symmetric operator on Moreover, A ^ 0, 

</l4/> = lim <f\p n (U)f>>0. 

71-4 00 

In order to prove that the limit A is independent of the chosen sequence 
(6.16), take two such monotonically decreasing sequences {pn\ eiq> )} and 
{/ 4 2 ) (e i<p )}, for which 

lim P { }\e iq> ) = lim p (2) (e iQ} ). 

The above assumption implies (see Exercise 6 . 6 ) that for every integer n 
there is some pkf(e i(p ) such that 

p™{**) < Pn\e i(P ) + lIn. 

By using again the last statement of Lemma 6.1 we infer that 

PlJU) < p™(U) + 1 In, 

which implies that 

s-lim^W) < s-lim p™(U). 

By reversing in the above argument the roles of p ( n{ ei(I3 ) and Pn \ ei(P ) we 
can deduce that 

< s-lim/>«>([/), 

which implies (see Exercise 6.2) that the above two limits are equal. 
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Finally, the property (6.18) of the mapping (6.17) follows from (6.13) 
by using the result of Exercise 5.6 on strong limits. Q.E.D. 

Consider the following family of periodic functions IJ A ((p) defined by 

77 A (<p) = 0 for A < 0, 

/7 A (<p) = 1 for A > 2 t r, 

i qv n , * _ ( 1 when ItiTT < (p < 2ti7T + A 

} A(p) ~ \ 0 when 2htt + A < <p < 2{n + 1) tt, 

0 < A < 2tt, ti = 0, zblj dz2,... . 

We shall first show that the operators E A assigned to the above functions 
by the mapping (6.17) are projectors, and later on we shall prove that 
these projectors constitute the spectral family of Theorem 6.1. 

Lemma 6*3♦ The functions (6.19) belong to the family . The 
operator E A assigned to /7 A (<p) by the mapping (6.17) is a projector in Jf. 

Proof. It is easy to construct* for any given function n A (cp) a mono- 
tonically decreasing sequence 

u i(<p) > u 2(<p) > ••• 

of once continuously differentiable and periodic functions u n (<p) 
u nW) — u n{<P + 2&7t), k — dil, ±2,..., 
converging to n A (cp). Choose a sequence ^ , § 2 ,...—► 0, where 
(6.20) 0 < S n < \ mf [min{w w (<p) - u^y), u n+1 (cp) - u n (cp)}]. 

According to Chapter II, Theorem 7.3, there can be found for each 
u n (cp) satisfying the above imposed conditions a trigonometric polynomial 
p n (e iq> ) for which 

I u n (<p) — p n {e icp )\ < h n , <p e 1 R 1 . 

Due to the conditions (6.20), we have that ^ e l<p ) ^ ••*, and 

obviously 

lira p n (e tv ) = lim u n (<p) = II x (<p). 


This proves that PI^cp) e *p- 

* See the construction of the functions h n (x) in the course of the proof of Theorem 5.6 
in Chapter II. 
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If 2? A denotes the symmetric operator in 34? assigned to 77 A (9), by (6.18) 
we shall obtain E A 2 — E A , due to the fact that 

nm = njjp). 

Consequently, E A is a projector. Q.E.D. 

6.4. Uniqueness of the Spectral Family 
of a Unitary Operator 

We have to prove now that the projectors 2? A defined in Lemma 6.3 
constitute a spectral family. 

From the obvious relation 

njjp) n x {<p) = n x ( v ), a < a'. 


and (6.17) we get 

(6.21) E a E a >=E a , A<A'. 

Thus, in order to establish that E A , A e R 1 , is a spectral family, we are 
left with the task of demonstrating that 

(6.22) = #a+ o = s-lim Ey . 

First, we note (as can be easily and explicitly proven in the same fashion 
as in Lemma 6.3) that a monotonically decreasing sequence {p n (e iQ) )} of 
trigonometric polynomials can be found which is such that 

lim p n (e i(p ) = II x (e iQ3 ), 

7l~*00 

and in addition 

Pn(J") > n x+i/n(<p), n = 1 , 2 ,... . 


Consequently 


Pn(U) ^ ^A+l/w j 


and since E A < E A+l/n , 

< s-lim E M/n = s-lim p n (U) = E x , 


i.e., 


s-lim E A+1/n = E a . 


As any sequence , A 2 ,... A + 0 can be majorized by a sequence 
containing only the numbers A + 1/w, n — 1, 2,..., and converging to A, 
we see that (6,22) is true. 
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To establish now that E A , A e R 1 , of Lemma 6.3 is the desired spectral 
family, note that the definition (6.19) of I7 A (<p) yields immediately that 
(6.4) is satisfied. Thus, we have to verify that (6.1) is true. 

If we keep the notation of (6.1) and (6.2), we see that for X k _ 1 ^ 9 ^ A k 

e i« _ £ e ^{n h { v ) - n h j<p)) k 1 4* — & I ^ I 9 - K' I < «. 

1=1 1 

Since the above holds for k = 1,..., n, we have 
(6.23) 0 < k - t - #**»)]* 

L 1C=1 


X k - t e iA Kn4<p) - 

L Jc =1 

for any 9 e [0, 2tt] . From the last statement in Lemma 6.1 about the 
mapping (6.17), alias (6.11), it can be easily derived that (6.23) implies 

0 <[U-t e^(E h - - t e^{E Xk - 2^j] < «». 

L k=l k =1 

Thus, we have 


for all / e and consequently 

I n .. 

k =1 11 


which proves ( 6 . 1 ). 

To establish the uniqueness of the spectral decomposition of U , 
assume that 2?/ is a spectral function satisfying (6.1) and (6.4). Because 
of the pairwise orthogonality of E[ k — E f x for different values of k> 
we easily derive (using the results of Exercise 5.10) that for any nonnega¬ 
tive integer m 



E u 


= u-lim 

71-400 

e -*0 



J X k 


(Ek - E; k j 


m 

= U m , 
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e-*0 1 

= u-lim [ £ ~ ^l" = (^*) m = ^ m , 

”eio k=1 -* 

i.e., in symbolic notation, 

f e im(p dEJ = U m , m = 0, ±1,... . 

^ R 1 

Due to linearity of the above integration procedure, we easily establish 
that for all trigonometric polynomials p{e iq> ) we have 

f P(e i »)d<f\E Q) 'gy = <f\p(U)gy 

J R 1 

for any /, g e Jf. With the help of Theorem 3.10, Chapter II, which 
remains valid for complex measures, we can generalize the above formula 
to any function zv(cp) e 

f w(<p) d(f | EJg) = </1 A w g >, 

J R 1 

by approximating w(cp) by a sequence (6.16) and then going to the limit. 
Thus, in particular, for zv(cp) — I7 x ((p) we get 

f n A ( 9 )d(f\E Q} 'gy = (f\E xg y, 

J R 1 

i.e., after carrying out the above integration, 

Since E 0 ' = 0, and the above identity is true for any /, g e we get 
= E a for all A e R 1 . This establishes the uniqueness of the spectral 
decomposition of U and completes the proof of Theorem 6.1. 

6.5. Spectral Decomposition of a Self-Adjoint Operator 

The spectral theorem for self-adjoint operators on a Hilbert space 
can now be stated and proved. 



250 


III* Theory of Linear Operators in Hilbert Spaces 


Theorem 63* (The spectral theorem). To each self-adjoint operator 

A in a Hilbert space M" corresponds a unique spectral function E x , 
A e (R 1 , such that the domain 2 A of A consists of all the vectors / for 
which 

(6.24) f < +oo, 

J ur 

and for any / e 2 A and g e 

(6.25) (g\Afy = f Xd{g\Ejy. 

J R 1 


Proof. We shall derive the existence and uniqueness of the spectral 
function of A from the existence and uniqueness of the spectral function 
of the Cayley transform V of A. 

Inasmuch as A is self-adjoint, its Cayley transform V is a unitary 
operator (see Theorem 4.9), which, therefore, has a spectral decomposi¬ 
tion 

V = f J"dE 0 . 

J [ 0 , 27 t ] 

According to Theorem 4.8, if / £ 3) A we have 

(6.26) Af = f(l + V)h, h = (1 - F)- 1 /. 

Consequently, for any we get by applying in the last step of the 

following derivation the result of Exercise 6.7: 


<* i a r> 


= Kg I *> + Kg I Vh> 

= * f d <g I K h > + 1 f eiw d <g I £w h > 

J [0,2tt] J [0,2t t] 

= *•/ 

J [0,2tt] 1 e 




) d Pg,h(9)> 


where the complex measure [x gth (B), B e is defined as 


= f (1 -e™)d(g\£ <t h). 
J B 
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On the other hand, by making use of the fact that / = (1 — V)h and 
E{B)V = VE(B) (see Exercise 6 . 8 ), we easily derive 

( 6 . 27 ) ig | E(B)fy = (g | E(B) hy-(g | VE(B) h) 

= J r , d( S I h> - f d(g I ^£(5) K} 

J L0.2t r] J [0,2 tt] 

= f (l -«*)<*<* I £.*> = n gih (B), 

J B 

by noting that by (5.16) and Theorem 5.3 

EJ{B) = E(B n [0, «p]), £ C [0, 2ir]. 

Thus, we obtain 

<g | Af> = - f cot i«p 1 £*/> = f A dig I £„/>, 

^ I0.27rj J K 1 

where in the last step we had introduced the new variable 

A = —cot |<p, 99 g ( 0 , 2 ?r), 

and in terms of it the new spectral function 

— E(_2 arccot a) ~ ^<p • 

For/e 3) A we get from (6.27) 

if I £(B)fy = f (I — «*•) dif I Ejiy 

J B 

= f (l - e^) dih | Ejy* 

J B 

= J (1 -e<«)df*U<P) 

J B 

= f I 1 - «*• I 2 dih | Ejiy 

J B 

= 4 f sin 2 | 

J B 

and consequently (see Exercise 6 . 7 ) 

/ A2 dif I Ejy = f cot 2 i<p d(f I £;/> 

R l J t«,2n] 

= 4 f cot 2 fp sin 2 dih \ Ejiy 

J [0.2tt] 

= 4 f cos 2 \<p dQi | Ejiy 

J [0.2tt] 

< 4 f d(h | Eji) — 4<A | /*> < +oo. 

J f0,27r] 
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The above consideration proves the existence of a spectral function E x 
satisfying (6.24) and (6.25) for all/e 2 A . We now want to show that there 
is only one such spectral function. 

Suppose £/, ^ G H^ 1 , is a spectral function satisfying (6.24) and (6.25) 
for all/e 2 A . It is easy to see (see also Exercise 6.9) that the operator A' 
defined on the set 

(6.28) j/: J A 2 d(f | Ejy < +a o, fe 3T j 

as the operator for which A'f satisfies 

<g\A'f> = f Xd(g\Ejy 
J u 1 

for all# e is linear. By the original requirement, we have A' D A and 
consequently A'* C A* = A (see Exercise 2.9). On the other hand, A' 
is symmetric, since for g e 2 A * , 

<g I A’fy = f A d(E A 'g |/> 

J u 1 

= f Xd <f\ E *g>* = 

J K 1 

Thus we also have A' C A’* C A, i.e., A = A'. 

Now we can reverse the earlier procedure of going over from the spec¬ 
tral function of V to the spectral function of A, and derive that 

— ^--COt£<p 

is the spectral function of the Cayley transform of A. Due to the unique¬ 
ness of (see Theorem 6.1), it follows that = Ej, i.e., E x = E A '. 

The above argument establishes at the same time that (6.28) coincides 
with the domain of definition 2 A of A. Q.E.D. 

The spectral function E A and the corresponding spectral measure E(B) 
satisfying the conditions of Theorem 6.3 are said to belong to the operator 
A . In that case it is customary to write symbolically 

(6.29) A = f XdE A . 

J u 1 

The integral on the right-hand side is usually referred to as the spectral 
decomposition of A. 

The integration in (6.25) extends effectively only over the spectrum 
S A of the operator A , where the spectrum can be defined as follows. 
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Definition 6*2* A point A e R 1 belongs to the spectrum of the 
self-adjoint operator A with the spectral measure E(B) if E(I) is nonzero 
for every open interval 7 containing A. 

If A ^ S A , then E(I 0 ) — 0 for some open interval 7 0 containing A. 
Hence, all the points in 7 0 do not belong to the spectrum. Thus, all the 
points not belonging to the spectrum constitute an open set, and there¬ 
fore the spectrum is always a closed set. 

Definition 6 3* A point A e S A is said to belong to the point spectrum 
Sp of A if the spectral measure of A is different from zero at A, i.e., 
2?({A}) =£ 0. The set — S A — Sp is called the continuous spectrum of A. 

One of the most important properties of the spectrum of a self- 
adjoint operator A is that it is the support of the spectral measure of A , 
i.e., 

(6.30) E A (S A ) = 1. 

In order to establish the above relation, let us write 

S A =C\S t , S.DS.D-, 

Jc =1 

where S k is defined as the union of all intervals [n/2 k , (n -f l)/2 fc ] which 
contain at least one point of the spectrum of A. Now, if a closed interval 
7 does not contain any point of the spectrum then E A (I) = 0. In fact, if 
that were not so, we could split the interval in two halves and then at 
least on one of the halves the spectral measure would be different from 
zero. By continuing the process we would obtain a monotonic sequence 
of intervals which shrink to a point in 7, and that point would obviously 
belong to the spectrum. Using this result, we can immediately infer that 
E A (S A ) = 1 and consequently, due to the continuity from above of 
spectral measures (see Exercise 5.8), we conclude that 

^(S- 4 ) == s-lim E A ( S k ) = 1, 

00 

i.e., (6.30) is true. 

6.6. The Spectral Theorem 

for Bounded Self-Adjoint Operators 

We conclude this section by showing that for bounded self-adjoint 
operators the integral in (6.29) can be proven to converge uniformly, 
as was the case with the similar relation (6.1) for unitary operators. To 
prove this statement we need the following lemma. 
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Lemma 6.4. If A is a bounded self-adjoint operator on 34?, its 
spectrum S A lies within the interval [—1| ^4 ||, +11 ^4 11], and at least one 
of the two numbers — 1 | A || and +|| A || belongs to the spectrum of A. 

Proof. If A e S A and if E{B ), B e 3BA, is the spectral measure of A , 
then E(I e ) + 0, / e = (A — e, A + e) for any e > 0. Hence, there is 
a nonzero f e e 34? for which E(I e )f € = f € . Consequently (see Exer¬ 
cise 6 . 10 ) 

M/J 2 = f Vd\\EJ c f^{\\\-ey\\f'\\\ 0 < € < I A I, 

so that | | A | — e | ^ || A || for any of the above e > 0, i.e., | A | ^ 
|| A ||. This establishes that S A C [— 1 | A ||, || -\-A ||]. 

If a = sup{| A |: A e S A } y then by (6.30) and Exercise 6.10 

II Af\\ 2 = f A 2 d || E h f\\ 2 ^ a 2 ll/ll 2 , fe@ A = 34?, 

^ [—a,+a] 

i.e., || A || < oc. On the other hand, a < || A || since S A C [— 1 | A ||, +|| A ||]. 
Hence oc = || A ||, and since S A is closed we must have that either 
oc e S A or —oc e S A , or both. Q.E.D. 

We are now ready to prove Theorem 6.4. 

Theorem 6.4. Let — 1 | A || = A 0 < A x < ••• < A n = +|| A || be a 
partition of the interval [— 1 | A ||, +|| A ||] and A/,..., A n ' any points for 
which A fc-1 < A fc ' < A fc , k = 1,..., n. Then in the limit of finer and finer 
partitions 

n 

(6.31) A = u-lim £ K'( E * H ~ E \ k J, I K — 4-i I, 

oj^O fc=1 

where E A is the spectral function of A. 

We shall only sketch the proof of this theorem. 

By following the procedure used in proving Lemma 6.1, it can be easily 
established that the mapping p(x) -> p(A) of the set of polynomials in 
xeR 1 into the set of linear operators in 34? has the following properties: 

Pl( X ) + P 2 ( X ) “ > Pl(A) + p2(A), 
pi( x ) p 2 ( x )Pi(A) p 2 (A ), 
p*{x)-^[p{A)}\ 
p(A) ^ 0 if p(x) > 0. 
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Then, by using the method employed in proving Lemma 6.2 one can 
show that the characteristic function xi ( x ) of the closed interval 
h = [—1| A ||, A] is on the interval [—1| \ ||, +|| A ||] the limit of a 
monotonic sequence pi(x) ^ Pz(x) ^ * * * of positive polynomials. 
Combining these two results one can easily show that 

is a spectral function with = 1. 

From the construction of this spectral function it follows that, given 
any partition —1| A || = A 0 < X 1 < ••• < A n = -f|| A || of the interval 
[—11 A ||, +|| A ||], we have 

^-(E^ — E h i ) + — 4 _,)' 

A summation in k = 1,..., n obviously yields 

n n 

(6.32) £ K-i{E\ k ~ < A < £ K(E h - E, h J. 

k=1 k=l 

Furthermore, from the inequality 

n n 

k=1 k=1 

n , n 

= Z (A, - K- x ){E Xk - E Xk J < CO £ (E h - E h J = w 

k=1 k=1 

we see that both sums in (6.32) have to converge to A in the limit co —► 0. 
Hence, (6.31) is established. 

From (6.31) we easily deduce that 

<f\Ag> = j\d(f\E,g} 

for all/, g e On account of the uniqueness of the spectral decomposi¬ 
tion of A y we conclude that E x is the spectral function of A. This com¬ 
pletes the proof of the theorem. 

Exercises 

6J. Prove the generalized Schwarz-Cauchy inequality 

l</1 Ag)\ 2 < </1 AfXg I Ag\ 

which is valid for any positive symmetric operator A defined on 
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6*2* Show that if A and B are two symmetric operators on , then 
A ^ B and B ^ A implies A = B. 

6.3. Show that if A 1 , A 2 ,... is a sequence of self-adjoint operators 
on and s-lim n ^ 00 A n f = /' for each/e Jf 7 , then A(f) = /' is also a 
self-adjoint operator. 

6*4* Verify explicitly that Lemma 6.1 is true. 

6*5* Prove the lemma of Fejer and Riesz , which states the following: 
if p(e iqp ) is a positive trigonometric polynomial, p(e iv ) ^ 0, there is a 
trigonometric polynomial such that p(e iv> ) = | q(e iv ) | 2 . 

6*6* Assume that u 2 (x),...> and ^(a?), v 2 (x)>...> are two monoto- 

nically decreasing sequences of continuous functions on the finite 
interval /, for which lim n ^ 00 u n (x) = lim n ^ 00 v n (x). Prove that for each 
integer n there is some other integer k n such that 

“* n (*) < + 1 /*• 

6.7. Show that if f(£)g(£) and#(£) are /x-integrable complex functions 
on Ry and /x 1 (S') denotes the complex measure 

ih(S)= f g(Qdr(Q, SCR , 

J 5 

then/(£) is ^ integrable on R and 

f /(o^(o= f mgMMO- 

Jr j r 

6*8* Show that if U is a unitary operator and E(B) y B e SS 1 , is its 
spectral measure, then UE(B) — E(B)U. 

6.9. Show that if E x , A e R 1 , is a spectral function, the set 

|/: j^d<f\EJ}< + o>j 

is a linear subspace of . (Hint-. \ || E x (f + £)|| 2 < || £ A /|| 2 + II E x g || 2 ). 

6.10. Prove that if A = A*, then for /e Si A 

II Af\\ 2 = f \*d(f\Ejy. 

J U 1 


References for Further Study 

Consult any text on functional analysis, such as Akhiezer and Glazman [1961], Kato 
[1966], Riesz and Sz. Nagy [1955], Schechter [1971], Yosida [1974]. 



CHAPTER IV 


The Axiomatic Structure of Quantum Mechanics 


In the quantum mechanical problems encountered thus far we have 
had to limit ourselves basically to the special version of quantum mechan¬ 
ics known as wave mechanics, in which we could characterize the state 
as a wave function. The knowledge of the theory of linear operators in 
Hilbert spaces acquired in Chapter III will enable us to study the struc¬ 
ture of quantum mechanics from an abstract and very general point of 
view, in which the formalism of wave mechanics becomes a particular 
realization of the general formalism. 

We start this chapter with a section on the theory of measurement in 
quantum mechanics, which investigates the relation of the basic theoret¬ 
ical constructs to experimental material. We proceed then to explore 
the basic mathematical concepts of quantum mechanics, formulating in 
the process the fundamental axioms which these constructs have to obey. 
This entire exposition will lead in a natural way to specialized versions 
of quantum mechanics, such as the already familiar case of wave 
mechanics. 


I* Basic Concepts in the Quantum Theory 
of Measurement 

1.1. Observables and States in Quantum Mechanics 

The theory of linear operators in Hilbert space developed in 
Chapter III has equipped us with all the necessary tools for a very 
general formulation of quantum mechanics. 

As we saw in the Introduction, the basic categories of primitive con¬ 
cepts of quantum mechanics are, as in the case of classical mechanics, 
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those of state and observable. In order to be able to give a mathematical 
description of these objects, we associate in classical mechanics with any 
given system a phase space; e.g., with an n-particle system we can 
associate a 6n-dimensional vector space. For the same reason, we associate 
a Hilbert space with any given system which we intend to describe 
quantum mechanically. For instance, as we have seen in §5 of Chapter II, 
if the system consists of n particles, which are of different kinds, without 
spin and moving in three dimensions, then in wave mechanics (which is a 
special version of quantum mechanics) we associate with that system 
the Hilbert space L 2 (lR 3rl ). 

Let us assume now that we are dealing with a particular quantum 
mechanical problem in which a certain system has been specified 
(e.g., a hydrogen atom) and with which a certain Hilbert space is 
associated [e.g.,L 2 ((R 6 )]. It is then postulated that to each observable 
corresponds in the formalism a unique self-adjoint operator acting in ^ 
—though the converse is by no means required, i.e., there might be self- 
adjoint operators in ^ which do not correspond* to any observables. 

As an example of operators corresponding to observables related to a 
two-particle system (e.g., the hydrogen atom) described wave mechan¬ 
ically in L 2 (1R 6 ), consider the self-adjoint operators Qi X \ Qi V \ Q { /\ 
Pf\ P\ y \ Pf y i = 1, 2, defined as follows. 

Q {x) is the self-adjoint operator [see (4.19) of Chapter III] with a 
domain of definition 

(1.1) S>! x) = U: </. eL 2 (R«), f | x t 4>{x )| 2 d«x < +ooj, 
where 

(1.2) x = (x 1 , y x , * 1 , * 2 , J 2 , * 2 ), = dx x ••• dz 2 , 

which acts on a i/j e 2)^ in the following way: 

(1.3) (& W 0(*) = 

An analogous definition holds for Q[ y) and Q\ z) . 

P\ x) is the self-adjoint operator [see (4.20) of Chapter III] with a 
domain of definition 

(1.4) = j^: 4> sL*(R% J I p? f(p)\* d«p < +ooJ, 

* In the early days of quantum mechanics there was a tendency to assume that each 
self-adjoint operator represents an observable (see von Neumann [1955]). However, the 
existence of so-called superselection rules (which will be considered later in this chapter) 
indicated for the first time that this requirement lacks experimental support. 
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where $(p) is the Fourier-Plancherel transform of ip(x)> and 

(1.5) p = (p[ x \ p[ y \p[ z \ p {x \ p (y \p (z) \ d*p = dp™ ••• dp£'\ 

P]* ) acts on any i p e ^ (IR 6 ) C ^^((R 6 ) according to the formula 

( 1 . 6 ) (PPfflx) = 

Pf V) and P\ z) are defined analogously. The operators Q ( ?\ Q^\ Qi Z) 
correspond then to observables which from the experimental point of 
view represent measurements of the x, y , and z coordinate, respectively, 
of the ith particle in a Cartesian inertial reference frame. Similarly, 
Pi X \ Pi V) y P { i Z) correspond to the measurements of the x t y , and z com¬ 
ponents, respectively, of the momentum of the ith particle with respect 
to the same frame.* 

As another example of an observable for the same system, consider 
the operator H' with domain of definition 2 R , — ^(IR 6 ), which maps 
a i/j e Q) H ’ into 

0.7) (»■«(*)+ f'M) m 

For a very wide class of suitable potentials V(x) f the above operator H' is 
essentially self-adjoint, i.e., its adjoint (//')* 3 H' is a self-adjoint 
operator. Its self-adjoint extension (//')*, customarily denoted by H , 
is called the Hamiltonian of the system, and is an observable correspond¬ 
ing to measurements of the total energy of the system. 

In general, for any given system, there are correspondence rules 
specifying which self-adjoint operator corresponds to each particular 
observable related to the system. These rules are part of the set of corre¬ 
spondence rules relating the formalism to the experimental framework. 
We shall meet some of the most important of these rules, which are 
concerned with the most fundamental observables, such as energy, 
position, momentum, angular momentum, spin, and charge, as we 
proceed in our study of quantum mechanics. 

Let us turn now to the concept of state. As a generalization of the 
consideration in §5 of Chapter I and §5 of Chapter II, it is postulated, 
in general, that a state + is specified by a vector-valued function W{t), 


# See Exercises 1.5 and 1.6 for the spectral measures of these operators. 
f We are adopting here the Schroedinger picture of quantum mechanics. In §3 we shall 
encounter other physically equivalent pictures. 
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m e of the time parameter t\ W(t) is never zero, i.e., || ^(^H > 0 
for all t e U 1 . We have, again, as in the aforementioned sections, that if 
¥*( t ) describes a state, then cW{t)y for any nonzero constant c> describes 
the same state. The reason for this assumption lies in the nature of 
Born’s correspondence rule, which is the fundamental principle on 
which the accepted interpretation of the quantum mechanical formalism 
lies. We shall now focus our attention on this rule. 

1.2. The Concept of Compatible Observables 

Let A be a self-adjoint operator corresponding to an observable 
(also referred to as A) f and denote by E(B)> B e the spectral measure 
of A 



Consider a system which is known to be at time t in a state described by 
the normalized vector ¥ / (^). In keeping with a suggestion first forwarded 
by Born [1926a, b] for the case when A is a position observable [see 
(5.2) of Chapter II], we interpret the measure 

(1.8) Pi it) (B) = <T(t) | E(B) ¥*(*)>, B e 

as a probability measure in the following way: a measurement to 
determine the value of A, carried out on the system at the time t, has 
the probability P^ {t) (B) to have as outcome a value within the set B. 

A generalization of this interpretation can be applied to any finite set 
of compatible observables. This necessitates an understanding of the 
concept of compatibility, which is empirically rooted and to which we 
give the following meaning. 

Definition 1 A ♦ The observables A 1 ,..., A n related to a certain system 
are compatible if, in principle, arbitrarily accurate measurements of the 
simultaneous values of these observables can be carried out; i.e., if 
for any prepared ole S from the set SCR 71 of feasible (cf. Definition 
5.2) simultaneous values of A 1 ,...,A n , and for any open confidence* 
interval I containing a, the confidence probability p of actual readings 
falling within I can be brought arbitrarily close to one by a suitable 
design of apparatus for the simultaneous measurement of A 1 A n . 

Denote the self-adjoint operators corresponding to the observables 
A 1 A n also by A x A n . It is a basic feature of all quantum mechan- 


* See Sections 2.49 and 5.21 of Dietrich [1973]. 
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ical theories that if the n observables A 1 A n are compatible, then 
the corresponding operators must commute with each other. Note, 
however, that in general A x A n can be unbounded operators which 
are not defined on the entire Hilbert space and, therefore, for which 
we do not have an a priori guarantee that the commutator 

[Ai , A k ] = A t A k — A k Ai 

is defined on a domain which is dense in Jf. We bypass this difficulty by 
introducing the following concept of commutativity. 

Definition 1*2* Two self-adjoint operators A 1 and A 2 are said to 
commute if their respective spectral measures E {1) (B) and E (2) (B) 
commute, i.e., 

(1.9) [E' 1 ^), E<*\B 2 )\ = 0 
for all B 1 , B 2 e 2 1 . 

In the case when A x and A 2 are bounded self-adjoint operators with 
2 A = 2 A = (1.9) is a sufficient and necessary condition (see 

Exercises 1.1 and 1.2) for [A 1 , A 2 ] = 0. 

1.3. Born’s Correspondence Rule 

for Determinative Measurements 

The above considerations show that when A x ,..., A n are self-adjoint 
operators corresponding to compatible observables and E (1) (B),..., E (n) (B) 
are their respective spectral measures, then E (1) (.B 1 ),..., E (n) (B n ) commute, 
and consequently the operator 

(1.10) E(B 1 x - X B n ) = £<!>(£,) - E^(B n ) 

is a projector (see Chapter III, Theorem 3.6). If ¥*( t) is a normalized 
vector, || ^(^H = 1, representing the state of a system at the time t , 
then the generalization 

(1.11) x - x B n ) = <W(t) | E a) (B 1 ) - EM {B n )V(t)> 

of (1.8) is interpreted as being the probability that a measurement 
designed to determine the simultaneous values ,..., A n of A 1 ,..., A n , 
respectively, will yield a result 

A x g B x ,..., A n g B n , B x ,..., B n g 


when carried out on the system at time t . 
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We can generalize the above statement further by noting that the set 
function (1.11), defined by (1.11) only on the Borel sets of U n which 
belong to (& 1 ) 71 = SB 1 X ••• X <^ 1 , can be extended to a measure 
Py\’ t y ,An (B) y B e on the family S8 n of all the Borel sets in U n . 
Moreover, by Theorem 5.5 in Chapter III, the projector-valued set 
function (1.10) can be extended to a spectral measure E Al3 ” U3A *(B) 
defined for all B e ^ n , and such that 

p^ t r An ( B ) = <m i e Ai . \b) m>■ 

This suggests the following interpretation, which applies to measure¬ 
ments which are designed to determine (i determinative measurements; 
see the Introduction) the simultaneous values of any finite number of 
given compatible* observables (see Section 8.9 for incompatible ones). 

Definition 13 (Born's correspondence rule for determinative measure¬ 
ments). Let A 1 ,..., A n be n commuting self-adjoint operators acting 
in Jf, with spectral measures E Al (B) y ... y E An (B ), respectively, and 
representing n compatible observables. Denote by 

(1.12) E Al . An (B), B e 

the unique spectral measure which satisfies the equalities 

(1.13) E A ' . An (B 1 x ••• X B n ) = E^BJ • E An (B n ) 

for all choices of B x ,..., B n e & 1 . If the vector-valued function t) 

m E , —00 < t <C “j” 00, 

represents an already prepared state of a given system, then the measure 

(U4) pfor^B) = 11 m\\-\m 1 e Ai . m) m> 

is a probability measure: for each B e 3% n the number Py\’ t y iA n(B) is 
the theoretically predicted probability that a measurement carried out 
at time t to determine the simultaneous values Aj ,..., A n of A 1 ,..., A n , 
respectively, will yield a result within B , i.e., (A x ,..., A n ) e B at t. 


* The first attempts of treating incompatible observables were made in the 1960s 
(She and Hefner [1966], Prugovecki [1966. 1967], Park and Margenau [1968]). A con¬ 
sistent extrapolation of Born’s correspondence rule turns out to be indeed feasible 
(Prugovecki [1976], Ali and PrugoveCki [1977], Schroeck [1978]), and in fact quite essential 
for relativistic quantum mechanics (PrugoveCki [1978d, 1981a-c]). 
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According to the above interpretation 

(1.15) <A) w(t > = <V(i) I AV(i)> = f + “ A dPi (t) ( A), || !F(*)II = 1, 

•> —oo 

represents the mean value (also called expectation value) of the observable 
A for the system in the state *F( t) at time t. If A is an unbounded 
operator, then (1.15) is not necessarily defined for all vectors t ) e 

since the integral on the right-hand side might diverge for some such 
vectors. However, if ¥*( t ) e 3f A , then by the spectral theorem 

(1.16) f + °° A 2 dPy( t )(X) < +oo, 

J —oo 

and since | A | ^ A 2 for | A | ^ 1, the integral in (1.15) converges and 
(A)xf,(i) exists. 

As an example of compatible observables, take the six position observ¬ 
able i = 1, 2, defined by (1.1) and (1.3). The spectral measure 

(1.12) is (see Exercises 1.4 and 1.5) 

(E Q -°%Bmx, t) = XB (x) Kx, t), 

and the probability of discovering the first particle at r 1 and the second 
particle at r 2 , where x = (r x , r 2 ) e B , is given according to (1.14) by 

(1-17) P^(£) = / \>Kx,t)\*d*x. 

J B 

This agrees with (5.2) of Chapter II. 

The six operators P\ x \ ..., i = 1,2, representing momentum and 
defined by (1.4) and (1.6), also correspond to compatible observables. 
In this case we have (see also Exercise 1.6) 

(1.18) t) = (U-\ XB ^])(x y t). 

Thus, in terms of the Fourier-Plancherel transform $(p) of i/j(o c), 

(1.19) f I $(py Ol 2 ^ 6 />* 

J B 

'The operators Q {x) and P\ x) do not commute. This feature corresponds 
to the experimental fact that the position and momentum are incompati¬ 
ble observables. 
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1.4. Born’s Correspondence Rule 
for Preparatory Measurements 

The above correspondence rule (Definition 1.3) and discussion of the 
process of measurement is related to the subject of determinative 
measurement. In this case, we have a system about which it is already 
known to be in a state described by the vector-valued function x F(t). 
Thus, we have a situation in which, on the one hand, the correspondence 
rule in Definition 1.3 predicts the probability that different observables 
of the system have certain values at any given time t, while, on the other 
hand, a procedure of measurement (such as those described in the 
Introduction for position and momentum) enable us to experimentally 
determine the values of these same observables. These considerations 
do not tell us, however, how to make sure that a system is indeed in a 
state X F(t ). This last question involves preparatory measurements or 
preparation-of-state procedures. 

Let us assume that we have (for the considered system) n compatible 
observables represented by the operators A x ,..., A n , winch are complete 
with respect to some simultaneous eigenvector Wq , 

(1.20) AJF o = Aj'ty,, A n W 0 = A$,°V 0 , 

i.e., for which W 0 is, up to a multiplicative constant, the only vector 
satisfying the n eigenvalue equations in (1.20) with the respective eigen¬ 
values Ai 0) ,..., Ajf*. According to Definition 1.1 of compatible observables, 
there must be not only determinative but also preparatory procedures 
which would enable us to prepare the system so as to have at some time 
t 0 a value of A k in the e neighborhood of X { k ] . If A^ 0) ,..., AJJ )) are not 
accumulation points of the spectra of A 1 ,..., A n , respectively, then for 
some e 0 there are in addition to A^. 0) no other points X k in the spectrum of 
A k which lie in the e 0 neighborhood of A£. 0) . Thus, there should be, in 
principle, a sufficiently accurate (e 0 -accurate) preparatory measurement 
which would tell us that the considered system had at time t 0 the value 
A^ for the observable A k , k = 1,..., n. Then the accepted framework 
of quantum mechanics says that the state ¥*( t ), — oo < t < -foo, of 
the system immediately after the preparatory measurement is such that 

(1.21) = yo. 

where W 0 satisfies (1.20). As we shall see in §3.2, the dynamical law 
of quantum mechanics assures us that in this case there is only one 
state Wfjt) satisfying (1.21). 
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An ideal preparatory measurement, like that above, is rarely the case 
in practice. Usually we can establish that the simultaneous values 
\ A n of the n compatible observables A 1 A n are within an 
n-dimensional interval /, or, more generally, within a Borel set B e 3S n 
at some instant t 0 . For instance, we have seen in the Introduction that 
the experimental arrangement on Fig. 2 does not offer us an absolutely 
precise knowledge of the momentum of a charged particle which emerges 
from the apparatus in the interaction region, but it prepares a range A 
of momenta. 

We see that we cannot limit ourselves to the very idealized case treated 
by (1.20). Instead, we should be able to cope with the most general 
imaginable case. To arrive at such a generalization of the case treated 
by (1.20), we shall recast (1.20) in a form which yields itself to straight¬ 
forward generalization. 

The vector W 0 satisfying (1.20) can be characterized as the unique 
vector (up to a multiplicative constant) satisfying 

(1.22) E A ' .' , "({A< 0) } x - X {Af}) W 0 = W 0 , 

where the above projector is given by the formula (1.13) with B ± = 
{Ai 0) },..., B n = {AJ l 0) }. Thus, by Theorem 3.3 in Chapter III, is the 
unique vector (up to a multiplicative constant) in for which 

(1.23) <W 0 1 E A ' . ^({A< 0) } x - X {Ai°>}) y 0 > = || <F 0 1| 2 . 

The characterization of W 0 by (1.22), or equivalently by (1.23), can 
be applied to the problem of characterizing the state of a particle 
prepared by the apparatus described in Fig. 2 (see the Introduction) to 
have at time t 0 its momentum within A . 

We achieve this generalization by introducing the projector E Pl (A), 
which, for instance, in the case of the two-particle system considered 
earlier, can be written as 

E*'{A) = E Vl,v \A x IR 3 ), 

with 2? p i ,p2 given by (1.18). According to (1.22) we can state that this 
two-particle system, with the first particle emerging from the aperture 
0 3 (see Fig. 2) at time t 0 , can be afterwards in any state W(t) which at 
t {) satisfies the relation 


(1.24) 


<!F(< 0 ) | E^(A) W(t 0 )> = || 
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It should be noted that the outlined preparatory measurement 
obviously does not single out a unique state W(t ), but rather a whole set 
of states. This state of affairs is by no means peculiar to quantum mechan¬ 
ics, but is shared by other physical theories. It is due to the ever¬ 
present “errors” of measurement. Thus, when trying to determine* in 
classical physics the state r(t), -~oo < t < - f-oo, of a particle of mass m 
moving in a known force field by measuring its momentum p at some 
instant t 0 , we determine, in fact, only some range A of momenta, since 
“errors” of measurement are always present. After this measurement 
the particle could be in any of the states r(t) for which 

(1.25) mr(t 0 )eA. 

A simultaneous measurement of position of the particle, which would 
locate it at t 0 within the space region A', would reduce the selectable set 
of states to those which in addition to (1.25) also satisfy the relation 

r(t 0 )eA'. 

However, even this measurement will not, strictly speaking, single out 
a unique state r(t) since in practice A and A' are never one-point sets. 

If we return to the quantum mechanical case, it is interesting to note 
that the experimental arrangement in Fig. 2 prepares not only a range A 
of momentum values of the particle, but also a range A' of the position 
coordinates; namely, if a particle has emerged in the interaction region 
during the time interval t 0 to t 0 -f- At while the aperture 0 3 was open, 
the particle must have been somewhere within the confines of that aper¬ 
ture at the instant t 0 . However, the accepted interpretation of quantum 
mechanics deals only with the measurements of compatible observables, 
and cannot take advantage of the additional information that is derived 
from simultaneous measurements of incompatible observables. 

We get the general form of this interpretation for the case of prepara¬ 
tory measurements by straightforward generalization from the special 
case of momentum measurement discussed above. 

Definition 1*4 (Born’s correspondence rule for preparatory measure¬ 
ments ). Suppose that a preparatory measurement of n compatible 
observables, represented by the self-adjoint operators A 1 ,..., A n , has 
been carried out on the system at time t 0 , and that this measurement 
establishes that the simultaneous values X 1 ,..., A n of these observables 

* Since the disturbances of macroscopic systems caused by the impact of photons 
used in the measurement process are negligible, the distinction between preparatory 
and determinative measurement is of no practical consequence in classical mechanics. 
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are within the set B e 38 n at the instant t 0 . Then, after the instant t 0 , 
the system can only be in a state Fft) satisfying at t 0 the relation 

(i .26) = u w(t 0 )\\-KW(t 0 ) | e a - . a \b) v(t 0 )> = l. 

According to the above, a preparatory measurement assigns to the 
system the entire family of all states satisfying (1.26). Since, in general, 
this family contains more than one state, the theory will not predict a 
sharp probality for the outcome within B' e of a 

determinative simultaneous measurement of A^,..., A m ' at time 
h {h > *o)> but rather a whole range of such values. Statistical considera¬ 
tions (to be mentioned in §8 in connection with the “statistical operator”) 
can replace this range by a representative “mean” value. 


1.5. The Stochastic Nature of the Quantum Theory 
of Measurement 


As we see from Definitions 1.3 and 1.4, the only link between the 
formalism and the physical reality is via the probability measures (1.14). 
An experiment consists in first having the system submitted to a prepara¬ 
tory procedure at some instant t 0 and then carrying out a determinative 
measurement at some later instant t x . The Born correspondence rule in 
Definition 1.4 enables us to specify a set of states ¥*(*) as being the states 
in which the system can “find itself” after the preparatory procedure. 
We note that even in the most ideal case we cannot assign to the system 
at time t 0 a single vector W 0 , but rather an entire family {F: W = cF 0 , 
c e C 1 , c ^ 0} of vectors. This explains the reason for the statement that 
if F(t) represents a state, then cF(t ), | c | >0, represents the same state. 

From Born’s correspondence rule for determinative measurements 
(Definition 1.3) we note that the quantum mechanical theory, as opposed 
to the classical theory, does not predict the outcome of a single measure¬ 
ment carried out at some instant t ± > t 0 . The prediction is statistical, 
and as such it refers to a large ensemble of identical measurements rather 
than a single measurement. This means that the preparatory procedure 
should be carried out at the time t 0 on a large sample of N identical and 
independent systems. Then the theory predicts the frequency with 
which a determinative simultaneous measurement of certain compatible 
observables A 1 A n , carried out at time t , will have an outcome within 
the set B ; i.e., if out of the N systems a number v{B) will have at t 
values within B for A 1 A n , then, if the theory is correct, we must 
have the following approximate relationship, 


m 

N 


pd .. A n 

1 'F(t) 


(B), 


(1.27) 
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with an accuracy which increases with an increasing number N of systems 
in the ensemble. 

It is conceivable that in some exceptional* cases a measurement on the 
set A 1 A n of compatible observables, which is complete with respect 
to W Q and satisfies (1.20), can serve not only to determine whether the 
values of A 1 ,..., A n are A[ 0) ,..., A^ 0) , but also as a preparatory procedure 
which leaves the system with the values A^ 0) ,..., AJ l 0) for A x ,..., A n , 
respectively—at least for a short time after the measurement. If the 
system was prior to this measurement in a state ^(J), then after the 
measurement at time t ± it will be in a state 'P'ft) for which ^'(^o) = • 

In this case, the probability 

(1.28) X - x { A< « 0) }) = K^o i «wi* 

can be called the transition probability of having the interference intro¬ 
duced by the process of measurement cause the transition of the system 
from the state *P(t) to the state x F'(t) which is equal to W 0 at t 0 . For this 
reason the expression | ¥ / 2 >| 2 for two normalized vectors 

representing two states at some instant is sometimes called a transition 
probability. 

Exercises 

LL Assume that A i , i = 1,2, are symmetric operators on 
M* = £& Ai = -® 4 2 , with spectral measures E {i) (B), i = 1,2. Show that 
if [£ (1) (£i), £ (2) (£ 2 )] = 0 for all B 1 , B 2 e then [A ± , A 2 ] = 0. 

1*2* Show that if A k , k — 1,2, are commuting bounded symmetric 
operators on Jf, then their spectral measures commute. 

Note. A corresponding statement is by no means true when A 1 , A 2 
are unbounded operators, i.e., when 2 Ai ^ and ^ . In fact, 

it has been shown by Nelson [1959] that there are self-adjoint operators 
with a common dense domain which they leave invariant and on which 
they even commute (being essentially self-adjoint; see §7) but possessing 
noncommuting spectral measures (see also Exercise 5.4). 

L3* Suppose p(x) >0 is a Lebesgue integrable function on IR n . 
Introduce the Lebesgue-Stieltjes measure /x P (B) = p(x) d n x y B e 3§ n . 


* That this is not always so can be seen from the extreme case when the system itself is 
destroyed by the determinative measurement, e.g., by becoming a part of the detector 
(for instance, when the detector is a photographic plate). 
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Show that a Borel measurable function/(#), x G is P'p integrable on 
R™ if and only iif(x) p(x) is Lebesgue integrable on R n , and in this case* 

f f ip^) pfa} d n x — f fix) dp. p {oc). 

J u n J r ” 

L4* Use the result of Exercise 1.3 to establish that the spectral 
measure (E ik) (B)ifj)(x) — Xb{ x ic) *A0*0> x g belongs to the operator 
(Quft)(x) = x k i/j(x), x = x n ), which has the domain of definition 

@Q k = {>p- J K » I **#*)!* d n x < +oo}. 

L5* The spectral measures of are {E Qk (B k )ip)(x) = 

Xb ( x k) 0(#)- Prove that the spectral measure E(B) defined by 
(E(B)i/j)(x) = Xb( x ) *A( x) coincides with E Ql '"' Qn (B). 

L6* Prove that if U is a unitary operator and A is a self-adjoint opera¬ 
tor with the spectral measure E A (B), then the spectral measure of the 
self-adjoint operator A 1 = U^AU is E Al (B) = U^E^B^U. 

2 ♦ Functions of Compatible Observables 

2.1. Fundamental and Nonfundamental Observables 

In §1 we had postulated that in quantum mechanics an observable is 
represented by a self-adjoint operator in a Hilbert space and that a 
state is represented by a vector-valued function x E(t), t e R 1 . Thus, as the 
most basic requirement for a quantum mechanical description of a 
system we have the following. 

Axiom O* To any quantum mechanical system belongs at least 
one complex Hilbert space Jf 7 in which the quantum mechanical theory 
of that system can be formulated. 

If h is a normalized vector in the complex Hilbert space ^f 7 , the set 

h = {ah: a e C 1 , | a | = 1} 

i8 called a ray in We note that if *P 0 is a normalized vector representing 
a state at an instant t 0 , then for any complex number a , | a \ = 1, 
aW Q is another normalized vector representing the same state. These 


* This result can be deduced also from Radon-Nikodym’s theorem (see Halmos 
11950]) once it is observed that p(x) equals the Radon-Nikodym derivative dp. p (x)ld n x. 
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remarks justify the frequently made statement that “a state of a quantum 
system is (at any given instant) a ray in the Hilbert space.” 

In analyzing the concept of observables we have demanded until now 
that an observable play the dual role of being related to a symbol of the 
formalism (a self-adjoint operator, in case of quantum mechanics) and 
at the same time be anchored directly in the experiment by being related 
to some empirical procedure (or procedures) for measuring it. We shall 
call observables satisfying both these conditions fundamental observables , 
to distinguish them as a subfamily of the wider family of observables in 
general, which will be introduced next. 

In order to realize the necessity of enlarging the concept of observables, 
consider the concept of a function of observables in classical mechanics. 
Take, for instance, a one-particle system. If we know the position r 
and momentum p of the particle at some instant, then we can compute 
the values of other observables—such as angular momentum r X p, 
kinetic energy* p 2 /2wz, potential energy V(r) —since all these other 
observables are functions of r and p. More generally, if we take any 
real-valued function F(r, p) in the six variables of r, p, we can think of it 
as an observable, because an indirect measurement of F( r, p) can be 
carried out by measuring the position and momentum of the particle 
and then computing F( r, p). 

The above argument indicates that we can generalize the concept of an 
observable in quantum mechanics by introducing functions of one or 
more already given fundamental observables. However, while in classical 
physics all observables are assumed to be compatible, this is not the case 
in quantum physics. Thus, in quantum mechanics we shall be able to give 
an adequate definition only of functions of a single observable or, at 
most, of a set of compatible observables. 

2.2. Bounded Functions 

of Commuting Self-Adjoint Operators 

The mathematical basis of the concept of a function of observables is 
contained mainly in Theorems 2.1 and 2.3 of the present section. 

Theorem 2*1* Let E An (B) be the spectral measures of n 

commuting self-adjoint operators A x ,..., A n in . Denote by 

E Al . An (B) y B e the spectral measure on 38 n which satisfies the 

relation 

(2.1) E* . An (B 1 x -• X B n ) = E Ai (Bt) ••• E^(B n ) 

# The mass m and the force field characterizing the problem are assumed to be given. 
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for all B 1 B n e BS 1 . If F(X 1 A n ) is a bounded Borel measurable 
function on (R n , 

(2.2) | F( A,A„)| < M, AjA„ € R», 

there is a unique bounded linear operator A satisfying the relation 

(2.3) </ \Ag} = ( F (Aj,..., A„) d(J | 

J R” in 

for all vectors/, g e Jff. This operator A, which is conventionally denoted 
by F(A ± A n ), is bounded and A n )\\ < Af. 

Proof. Since the complex measure 

/*.,(*) = </1 . 

is equal to a sum [see (5.13) of Chapter III] of finite measures, any 
bounded measurable function on IR n is integrable (see Chapter II, 
Theorem 3.9). Hence, the functional 

(f\g)=\ F(X ,,..., A w ) ,..., A w ) 

J u n 

is defined for all /, g e Ji?. This functional is obviously a bilinear form. 
Let us show that (/ | g) is a bounded bilinear form. 

Construct a sequence s 1 , s 2 ,... of complex simple functions on IR n 


n k 

(2»4) *fc(Ai,..., A w ) = ^ &ikXB ik (Xi ,..., A w ), 

i=i 

B ik n Bi’ k = 0 for t ^ i’, 
for which (see Exercise 2.1) 

(2.5) | s k (K A„)| < | F( A x ,..., A„)| < M, 

and for any f, g eJf 

lim f s k (A)d^(A) = f F(A) 

' C ~ ¥ °° J K n J (fJ W 


( 2 . 6 ) 
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An application of (2.5) and of the Schwarz-Cauchy inequality, first in 
and then in l\n k ), yields 

I s kW 
J u n 

n k , 

= ^ a ik H'f.gi^ik) 

i =1 * 


nic 

<£ ! a ik I K/l E Al . An (B ik )g>\ 


< M £ II E* . \B lk )f IIII E* . A «{B ik )g II - 

i=1 

r n k -il/2 r n k 

m[£\\E a ' . \B ik )f ||»] [£ II E A ' . A «{B i 1 c )gf\ 




= M 


(/I f . A *{B ik )f} | 1/2 | J £* HB ik )g) 


1/2 


The above inequality implies, when combined with (2.6), that 

l(/l*)l <M||/||||*||. 


Thus, (/1 g) is a bounded bilinear form, and therefore by a straight¬ 
forward application of the Riesz theorem (Chapter III, Theorem 2.3; 
see Chapter III, Exercise 2.5), we infer the existence of a linear operator 
A for which 


<f\Ag> = (f\g) 


and || ^ II < M. Q.E.D. 

An operator A for which a Borel measurable, bounded* function 
F(X 1 ,..., A n ) and commuting self-adjoint operators A 1 ,..., A n can be 
found such that A — F(A ± ,..., A n ) is said to be a function of the 
operators A 1 ,..., A n . It is important to note that for two different 
functions ,..., X n ) and F 2 (X 1 ,..., X n ) we shall have F 1 (A 1 A n ) = 


The case of unbounded functions is going to be treated in Section 2.4. 
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F 2 (A x A n ) in the case that these two functions coincide when 
X 1 e G S' 4 ” (see Exercise 2.2). Thus, we can define F(A 1 A n ) 

even in the case when the function F(X 1 ,..., A n ) is defined only on the set 
S^i x ••• X S A *, and is bounded and Borel measurable on 
S*i X ... x S An . We can do that by extending F(X 1 ,..., A n ) to a bounded 
function on IR n ; this can be done, for instance, by setting F(X 1 ,..., A n ) = 0 
when (X 1 ,..., A n ) <£ S A * X ••• X S A «. 

In the above considerations we have allowed F( Aj,..., A n ) to be a 
complex function. We shall establish that F(A 1 ,..., A n ) is self-adjoint 
when^Ai,..., A n ) is real by proving Theorems 2.2 and 2.3. 

Theorem 2*2* Let {A t ,..., A n } be a set of commuting self-adjoint 
operators. If F(X 1 ,..., A n ) is a complex-valued, bounded, Borel measurable 
function on U n and 7 r *(A 1 ,..., A n ) is its complex conjugate, then 
F*^ ,..., A n ) is the adjoint of F(A 1 ,..., A n ). 

Proof . According to Theorem 2.1 

</1 F(A t A n )g> = f F( A) d(J | E* **>. 

J M n 

The same theorem applied to F*(A 1 ,..., A n ) yields 

<£ | F*(A,A n )f > = f F*( A) dig | Ef A «fy. 

J u n 

Thus, we get 


= f m d<g i . A r>* 

J r” 

= f F(X) diEf . A »f\g) 

J R” 

= f F(X) dif | E Al .V 

J R” 

= <f\F(A 1 ,...,A n )g) 

for all/, g e This shows that F*(A 1 ,..., A n ) = \F{A 1 ,..., ^4 n )]*. 

Q.E.D. 

Theorem 23* If F(X 1 A n ) is a real bounded Borel measurable 
function on R n and A x A n are n commuting self-adjoint operators, 
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then the operator A = F(A 1 A n ) is self-adjoint and its spectral 
measure E A (B) satisfies the relation 

(2.7) E A (B) = E a . . \F~\B)) 

for all B e 3B 1 . 

Proof. If F(X 1 A n ) = jP*(A 1 A n ), we have according to the 
preceding theorem 

[F{A 1 A n )Y = F^A X A n ) = F(A X 

Thus, the operator A = ^ n ) * s self-adjoint. 

The projector-valued set function 

(2.8) E^\B) = .^(F- 1 ^)) 

is obviously a spectral measure on We shall prove that for all /, £ 

(2-9) <J\Ag) = f Ai</|£f^>, 

J r 1 

and that consequently the spectral measure E iF) (B) defined in (2.8) is 
indeed the spectral measure of A. 

From (2.3) we see that the relation (2.9) is equivalent to the relation 

(2.10) f F(X, A.) d(J | = f X d(f | Ei F) g). 

In the case whenF^ ,..., A n ) = >•••> B 1 e^ n , an easy computa¬ 

tion shows that (2.10) reduces to the relation (2.8), which is true by 
definition. Hence, we can immediately conclude that (2.10) is also true 
when F(X 1 ,..., A n ) is a simple function. From this result, (2.10) can be 
established straightforwardly first for F(X 1 ,..., A n ) nonnegative, and 
afterward for any real function F(X 1 ,..., A n ), by employing Definitions 
3.4 and 3.5 in Chapter II for integration on measure spaces. Q.E.D. 

2.3. Algebras of Compatible Observables 

Suppose that the self-adjoint operators A 1 ,..., A n represent compatible 
observables. If we carry out at some instant t a simultaneous measure¬ 
ment of these observables, we will determine, ideally, a measure 
—in the case that the system was in the state t ) at that 
instant. Thus, we can also compute the measure 

PndB) = p^vr^H-B)). 


(2.11) 
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It is then natural to say that the measure (2.11) provides the 
distribution of values of the observable represented by the operator 
A = F(A 1 A n ). This ruling is in agreement with (1.14), since by 
Theorem 2.3 we find that P^ {t) (B), defined by (2.11), satisfies indeed the 
following relation: 

p a wU) {B) = II m\\- 2 <m I e\B) w(t)y. 

We shall say that A represents a bounded observable which is a function 
of A x ,..., A n . The family of all such observables has an algebraic 
structure, which can be established on the basis of Theorem 2.4. 

Theorem 2*4. If F i (X 1 ,..., A n ), i = 1, 2, are two bounded Borel 
measurable functions on (R n and A x ,..., A n are commuting self-adjoint 
operators, then 

(2.12) (F 1 + F 2 )(A 1 ,..., A n ) = F X (A X A n ) +F 2 (A 1 ,..., A n ), 

(2.13) (F x -F^A ,,..., A n ) = ^,..., A n )F 2 (A 1 ,..., ^ tt ). 

Proof. Since F x and jF 2 are bounded functions, their sum F 1 -f- F 2 
and product F 1 • F 2 are also bounded. In addition, F x -f- F 2 and F x • jP 2 
are Borel measurable according to Theorem 3.5 of Chapter II so that 
the operators 

A =(F 1 +F 2 )(A 1 ,...,A n ), 

B = (F 1 -FJ(A 1 ,...,A n ), 
are defined by Theorem 2.1. 

We easily obtain (2.12) by using the definition of A as a function of 
A x ,..., A n to derive that for all/, g e Jf 7 , 

</1 Ag) = (FA A„) + FA A.)) rf</1 ^V"Vv?> 

= </ I A n)g> + </ I ..... A n)g>- 


In order to prove (2.13), write first 

(2.14) </1 F X (A X A n ) F 2 (A x A n )g} 

-= f F,(A t A„) d(f | E^yyfF^A, . 4 n )*>. 

iratt 1* n 
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Now we have for any B e 

<f\E* . An {B) F 2 {A 1 A n )g) 

= <E A1 An (B)f I F 2 (A 1 ,...,A n )g> 

= f F&i A,) i<E* .^(£)/|<V.3>> 

J K in 

J B in 

because the complex measure 

= <E* . \B)f\E A ' . ^(BJg) 

obviously vanishes on sets B 1 lying outside B. Consequently (see 
Chapter III, Exercise 6.7) 

f F i(*i ,-,K) d(J | E Al —y An F 2 (A 1 A n )g) 

J R w 1« 

= f m, x k )f 2 (\ 1 x n ) d(j i v?> 

J R n in 

= a\{F 1 -F 2 )(A 1 ,...,A n )g'), 


which, when combined with (2.14), shows that (2.13) is true. Q.E.D. 

We note that albeit in general we cannot add or multiply unbounded 
operators without paying close attention to domain questions (cf. 
Chapter III, §2.6), we do not run into similar problems with bounded 
functions of self-adjoint operators A 1 ,..., A n representing compatible 
observables. Denote by & h (A 1 ,..., A n ) the family of all bounded self- 
adjoint functions of A 1 ,..., A n . If a is a real number ^nd F(A 1 ,..., A n ) e 
(9 h (Ai ,..., A n ) y then obviously also ( aF)(A 1 ,..., A n ) e (P h (A 1 ,..., A n ). 
Moreover, if F 1 (A 1 ,..., A n ), F 2 (A 1 ,..., A n ) e (P h (A 1 ,..., A n ), then on 
account of Theorem 2.4, 

F\{A 1 ,..., A n ) -\-F 2 {A 1 ,..., A n ) = (-F x -\-F^){A 1 ,..., A n ) e&x ) (A 1 ,..., A n ) f 

F 1 (A 1 ,..., AJF^ ,..., A n ) = (F,-F^A ,,..., A^e^A,A n ). 

Thus, @- b (A 1 ,..., A n ) is a real algebra, usually referred to as the algebra 
generated by the observables A x ,..., A n . Furthermore, @\>(A 1 A n ) 
is obviously a commutative algebra. 
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In studying the relation between theory and experiment, it is often 
convenient to consider instead of the original observables A n 

the algebra & h (A 1 A n ). The reason for this is that, while some of the 
operators A 1 A n will be, generally speaking, unbounded and there¬ 
fore not defined on the entire Hilbert space, the operators in 
& h (A 1 A n ) are bounded and defined on the entire Hilbert space. 
On the other hand, & h (A 1 A n ) supplies all the physical information 
that A x A n do. The reason for this is that this information is 
contained exclusively in the expressions 

(2.15) . An (B)W(t)), 

where E Al ’" ,,An (B) are bounded functions of A 1 ,..., A n , 

(2.16) £*. a *(B) = X b(A A n ), 

as can be seen by taking for F(X^ ,..., AA in (2.3) the characteristic function 
Xb(K»~>K) of the set B e 88 n . 

If A is an operator representing an observable and W(t) e 2 A , then 
the expression 

(2.17) (A) nt) = (W(t) | AV(t)\ || V(t)\\ = h 

is called the expectation value of A at time t. In§l we saw [see (1.15)] that 
in the case of a determinative measurement of A at time t> (2.17) is the 
mean value of the statistical sample obtained as an outcome of that mea¬ 
surement if the system was in the state l f / (t) at time t. 

According to (2.15) and (2.16) we always have 

(2.18) P^r An (B) = (x B (A ± ,..., , || W(t )|| = 1. 

This prompts us to state that a knowledge of the expectation values of all 
self-adjoint bounded functions of the observables A 1 ,..., A n is equivalent 
to a knowledge of the statistical distribution of the simultaneous value 
of A x ,..., A n . Hence we shall call such functions also observables. 

*2.4. Unbounded Functions 

of Commuting Self-Adjoint Operators 

We shall study now the general concept of a function F{A 1 ,..., A n ) of 
n self-adjoint commuting operators A ± ,..., A n for the case when F( A), 
A 6 R n , is any Borel measurable function on R n . 
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Theorem 2*5* Let A 1 A n be n commuting self-adjoint operators 
in and E Al ’--’ An (B) y B e the spectral measure defined by (2.1). 
If F(A), A e R n y is a Borel measurable function, then there is a unique 
linear operator A which satisfies 

(2.19) <£ | Afy = f F( A) d<g I Ef ^»/> 

J u n 

for all g e 3#*. This operator is defined on the domain 

(2.20) a A = (/: J | 2?(A)|» d\\E* . An f II 2 < oo j; 

A is usually denoted by F(A 1 A n ). 

Proof. Denote by R the set of all vectors / e for which the linear 
functional 

<pAg) = f n F*(X)<KE? . ^f\ g y 

J U n 

exists for all g e 3#* and is continuous. For each such/ e R there is by the 
theorem of Riesz (Chapter III, Theorem 2.3) a vector h such that 

<h\g} = <p f (g), 

or equivalently, and with the notation h = A(f), 

<g\A(f)}=( F(X) d(g | E Al . A «f>. 

J u n 

The set R is linear, because if f x , / 2 e R y we can derive for any g e , 
by using the rules of integration, 

«i f m d <s i E t . An A> + f f( a) d( g i 

= f m d <g I Ef .^"(aJi + aj 2 )\ 

i.e., a,/, -f- a,,f 2 e R\ the above relation can also be written as 

<h(g I + a 2<g I A (f 2 )} 

= <g I M(A) + M(/*)> 

= <g I A(aJ 1 + aj 2 )) 


for all g e StF, which implies that A is linear. 
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In order to show that R coincides with the set 2 A defined in (2.20), let 
us show first that / e R implies that / e@ A . Taking in (2.19) g = Af, 
we get 

II 4 /T = f md<Af\E* . A iy 

= f F(X) d(E Al . A f\Af>\ 

and since we have (see the proof of Theorem 2.4) 

<£*. A ”(B)f 14/')= f F(\') d(E Al . a \B) f I E A r . A ”f> 

* ra* 

= f FQt)d<J\E£ . A i), 

J B 

we obtain/e 2 A (see Chapter III, Exercise 6.7): 

M/ll 2 = f |^(A)| 2 ^II< 1 .^/ll 2 <+oo. 

J u n 

Conversely, let us assume that the above relation holds for some / e Jf, 
and let us show that/e R . Choose a sequence of simple functions ^(A), 
s 2 (A),... such that (see Exercise 2.1) 

(2.21) | Re ^(A)| < | Re * a (A)| < • < | Re F(A)|, 

| Im^(A)| < | Im s 2 (X)\ < ••• < | ImF(A)|, 

lim s k (X) = F( A), A e R n . 

If we write ^(A) in the form (2.4), we can derive for any g e Jj? 

Tig 

(2.22) £ | Re «« ||<* I#* 1 . A "(B ik )f)\ 

i =1 

n « 

< I I a ik III E A - . A "(B ik )f\\ II E* . A *{B ik )g II 

i-1 

\l/2 ,n k \ 1/2 

< (E I a ik I s || E A ' .■ 4 "(£i*)/ll 2 ) (E II E A ' . A «{B ik )g Ip) . 

'i-i ' '<-i 














280 


IV* The Axiomatic Structure of Quantum Mechanics 


The expressions on the left-hand side of (2.22) constitute a convergent 
sequence since they are monotonically increasing for k — 1, 2,..., while 
the right-hand side is bounded by 

a vl/2 / r \ 1/2 

^ | .F(A)| 2 d || Ef .^/|| 2 ) • ( j d || E? .II 2 ) 

a \l/2 

R JF(A)| 2 rf||£^.'VH 2 ) • 

As this observation stays true if Re a ik is replaced in (2.22) by Im a ik , 
we arrive at the existence of the limit 


n k r 

lim X a ik (g | E a ' . A \B ik )f) = I F( A) d(g | Ef .^/>. 


This argument establishes, incidentally, the following inequality: 


f F(X)d(g\E A - 






Since the above argument stands for any g e Jf, it follows that/e R. 

Q.E.D. 

It should be noted that, in general, the domain of definition of 
F(A ±,..., A n ) is not dense in JHowever, in the case that this domain is 
dense and F( A) is real, it follows from Theorem 2.6 that F(A 1 ,..., A n ) 
is self-adjoint. 

^Theorem 2*6* If A x ,..., A n commute and A = F(A 1 ,..., A n ) has 
a dense domain of definition, then A* — F*(A 1 ,..., A n ), where the 
function F*(A) is the complex conjugate of F( A). 

Proof. Consider the operator Z), which is defined as that operator 
which satisfies 


<J\Dg>=\ F*(\) d(f | E Al . A ”g> 

J u n 

for all/e^f 7 . It is obvious from Theorem 2.5, especially from (2.20), 
that the domain of D is identical to t2f A . If we take /, g e Q) A = , 

we get 

</1 Dg> = (J un m d<g | Ef' . A «fy)* 

= <g\Af'>* = <Af\g\ 
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which shows that D C A*. We shall prove that D = A* by showing that 
Assume that h eS> A * and h* = A*h, so that the relation 


(2.23) 


<h | Af> = <A* |/> 


is valid for all f e 3) A . Consider the bounded and Borel measurable 
function 


F m ( A) = 


10 


if |F(A)| < M 
if |F(A)| > M. 


According to Theorem 2.1, the operator A M * — F M *(A 1 ,..., A n ) is 
bounded. We easily see that A M *h e 3 A , 


f \F{Wd\\Et . A «A M *h\? 

J n” 

= f \F(\)\*d\\A M *Ef .'‘•AH* 

J re" 

= J r . I m \ 2 d* [/ H . I FM 2 d u II E* . A *E A ' . Anh II 2 ] 

= f |F(A)F M (A)|*d||Ff.'‘•AH* 

J K 

= j |F*(A)|*d||£*. An h II 2 < +oo. 

Thus, we can insert in (2.23)/ = A M *h, and by taking into account that 
A m * and E Ax, — ,A * commute (see Exercise 2.4), we obtain 

<A | AA M *h} 

= f F( A) d(h I E A ' . A ”A M *hy 

= f -F(A) dQi I A m *E? . A »h) 

= j^F( A) d x [J k /m*(A') dy<k I E A ) . a «e a > . A «hy] 

= f F(\)F M *(\)d\\Ef' .'‘•AH* 

J U n 

= f | F m (A)|* d II E+ . A «h II* = II A M *h II*. 

J (F8 n 
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From (2.23) and the above equation we get 

II A M *h || 2 = </** | A M *h> < || h* || || A M *h ||, 
which implies that 

a \l/2 

I ^m(A)| 2 d || E? . An h II*) = || A M *h || < || h* ||. 

If we let M —► + oo, then the left-hand side in the above relation stays 
bounded, and since 

UmF M (X)=F(X), 

iVa->oo 

the integral 

f iw^ii ^ 1 . ^\\ 2 

j n » 

exists and is finite (see Chapter II, Theorem 3.10). Thus, h e Qf D , which 
establishes that @ A * C 2 D . Q.E.D. 

We can generalize the concept of a function of compatible observables 
A x ,..., A n by including all real Borel measurable functions F( A) which 
are such that the set (2.20) is dense in Jff. Theorem 2.6 guarantees that 
the operator F(A 1 ,..., A n ) representing such a function will be self- 
adjoint, and therefore we can call it observable. As a generalization of 
Theorem 2.4 to the unbounded case we prove Theorem 2.7. 

Theorem 2*7* If F i (A 1 ,..., A n ), i = 1,2, are any two functions of 
the commuting self-adjoint operators A 1 ,..., A n , then 

(2.24) F X (A ,..., A n ) + F 2 (A x ,..., A n ) C (F 1 + F 2 )(A 1 ,..., A n ), 

(2.25) F 1 (A 1 ,..., AJF^A, ,..., A n ) C (F, • F 2 )(A X ,..., A n ). 

For any function F(A 1 ,..., A n ) and any integer m ^ 0 we have 

(2.26) [F(A 1 A n )F = (F m )(A 1 ,..., A n ). 

Proof . It is easy to see that 

®F X ^ Q ^Fi+F 2 

since whenever i^A) and F 2 ( A) are square integrable with respect to a 
measure 


/*/(*) = || E Al ' 


\B)f || 2 , 
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the function (F 1 + F 2 )(X) is also square integrable (see Chapter II, 
Theorem 4.2). If we introduce the functions 

F m = TO if \m)\<M 

m{> |0 if I ^(A)| > M, 

then we have by Theorem 2.4 

>•••> A n )f -\-F iM (AiA „)/ = (F Uf + i< 2Af )(ylj A n )f. 

By letting M —*■ oo and noting that for / e ii O we have 

-4«)/ = A n )f, i = 1, 2, 

Af-»oo 

(^1 +^2X^1 >•••> ^w)/ ~ “f“ ^2Af)(^l >•••> ^n)/» 

we get in the limit 

F X {A X ,..., A n )f + F 2 (A, A n )f = (F, + F 2 )(A 1 ^ n )/. 

We turn now to proving (2.25). By the already familiar procedure of 
dealing with spectral-resolution integrals, we get 

f | ^i(A)| 2 d II Ef 1 .^ re )/ II 2 

J R w 

= f I ^iWI 2 dA f | F &)| 2 d u || E? A i || 2 

J |j^« L J |j^« 

= f |F 1 (A)*' l (A)| 2 rfJ|E*. A »ff. 

J K « 

The above relation shows that whenever F 2 (A 1 ,..., A n )f belongs to the 
domain of definition oiF 1 {A 1 ,..., A n ), so thati^^,..., A n )F 2 (A 1 ,..., A n )f 
is defined, then / belongs to the domain of (F 1 • F 2 )(A 1 ,..., A n ). If we 
take such a vector /, and apply Theorem 2.4 to F 1M ^A X ,..., A n ) and 
^2 M 2 (A 1 ,..., A n ) to obtain 

(2.27) F 1M± (A 1 ,..., A n )F 2 Mz (A 1 ,..., A n )f — (F 1Ml • F 2M2 )(A 1 ,..., A n )f, 

we get by taking the strong limits, first for M 2 —> oo [note that the opera- 
t° r Fim {Ax ,..., A n ) is bounded and therefore continuous] and then 
for M 1 -> oo, 

F^A ,,..., AJFJA ,,..., A n )f = (F 1 • F 2 )(A 1 ,..., A n )f. 


Thus, (2.25) is established. 
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From (2.25) it follows directly that 

[WAi ,.,A n ) 

for any m = 0, 1, 2,... . However, when m ^ 2, the square integrability 
of (F{ A)) w for some measure implies the square integrability of 
(F(A)) wl ~ 1 , since | F(A)| 2(W ~ 1) ^ | F(A)| 2m at any point A where | F(A)| ^ 1. 
This means that &> F m C @ pm -i . Hence, by taking in (2.27) F x = F and 
F 2 — F m_1 , we get for / e pm C 2 F m-\ in the limit M 2 —> + 00 

F Mi (A ,,..., AJF^A ,,..., A n )f— (FJ^ 1 )(A 1 ,..., O/ 

By letting M x + oo we arrive at the result that FF m ~ x f — F m f and 
that 3) FF m-\ D 3)pm , i.e., FF m ~OF m . Thus, by induction in m 

[F(A X ,...,A)r2(F-)(A ,...,^ n ), 

i.e., (2.26) holds. Q.E.D. 

The relation (2.26) indicates that the definition of functions of observ¬ 
ables given in Theorem 2.5 coincides with the algebraic definition 
whenever the algebraic definition is feasible. In fact, the function of n 
commuting operators A x ,..., A n corresponding to a polynomial of n 
real variables \ ,..., A n is equal to the operator obtained by replacing X t 
by A i for i — 1,..., n everywhere in the polynomial. In particular, if 

P{ A) = d n X n + a n _ iA n_1 + ••• + a x A + a Q 

and A* = A, and if p(A) is defined by 

(2.28) </1 p(A)g) = f p( A) d(f | Efgy, feJf, ge ® An , 

J M 1 

it is easy to establish that 

P(A) = a n A n + cin^A™” 1 + ••• + a x A + a Q 

by using the linearity property of the integral in (2.28) and the relation 
(2.26) with F(A) = A. 

Exercises 


2*L Prove that if F{ A), A e (R n , is Borel measurable on IR n and /x(B), 
B e is a complex measure, then there is a sequence of simple func- 
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tions that satisfy (2.21) with sign Re F{X) — sign Re s fc (A), sign Im 
F(A) = sign Im ^(A), and F(X) is /x integrable if and only if 

)}™S K « Sk{X)dix{X) 

exists, in which case the above limit is equal to 

f F(X)d^(X). 

2*2* Let A 1 A n be n commuting self-adjoint operators and 
F( A), A g R n , a function which vanishes on S^ 1 X • • • X S^«. Show that 
F(AiA n ) = 0. 

2*3* The proof of Theorem 2.5 is independent of Theorem 2.1. 
Derive Theorem 2.1 from Theorem 2.5. 

2*4* Show that in general the symbol C cannot be replaced in (2.24) 
and (2.25) by the symbol =. 

2*5* Show that any two bounded functions F^A-^A n ) and 
F 2 {A 1 ,..., A n ) of A 1 ,..., A n commute. 

2*6* Prove that to each spectral function E x , A g R 1 , corresponds a 
unique self-adjoint operator A such that 

<g 140= f 

J u 1 

for all g g Jrf? and for all / g 9 a . 

2*7* Show that if £) n are the self-adjoint multiplication 

operators (Q k i /r)(*i x n ) = x k ^{x 1 ,..., x n ) inL 2 (R n ), thenF^ Q n ) 

coincides with the operator A defined by 

W)(*l vj x n ) = F(x 1 ,..., x n ) ip{x 1 x n ), 

9 A = {^:F-</,gL 2 (R”)}. 

3* The Schroedinger, Heisenberg, 
and Interaction Pictures 

3.1. The General Form of the Schroedinger Equation 

We have seen in § §1 and 2 that in order to “describe” a system 
quantum mechanically, we have to associate with it a Hilbert space 
(§2, Axiom 0). The observables are represented by self-adjoint operators 
in while the physical states are vector-valued functions ¥*( t ) in the 
time variable t e R 1 . Clearly, not every vector-valued function can be a 
state, because in that case—as can be easily seen from the Born corre- 
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spondence rules in §1—the theory would lack predictive power. We 
must require that a function *F(t ) which represents a state should 
satisfy a dynamical law (see the Introduction). This law is given in 
quantum mechanics by an equation of motion, called the Schroedinger 
equation. 

We can arrive at the general form of this equation by generalizing the 
Schroedinger equations of Chapter I, §5 and Chapter II, §5. The essen¬ 
tial feature of those special cases was the existence of an operator //, 
which in those cases was the Schroedinger operator [essentially given by 
the Schroedinger operator form in (5.8) of Chapter II] and it was also 
related to the total energy observable. 

More generally, we postulate for each system which can be described 
quantum mechanically, the existence of a self-adjoint operator H y 
commonly called the Hamiltonian of the system . This operator is assumed 
to play a double role in the theory. On one hand, it represents the total 
energy observable, and it is therefore related to measurements of the 
energy of the entire system. On the other hand, it plays a key role in the 
equation of motion; namely, as a generalization of (5.5) in Chapter II, 
we postulate that a state satisfies the Schroedinger equation 

(3.1) ih = H¥(t). 

There are, however, two mathematical questions which a careful look 
at the above equation immediately brings to mind. 

The first question is concerned with the meaning of the derivative 

(3.1) . With some foresight (see also Exercise 3.1), we answer that ques¬ 
tion by defining 

T = ** T, + 

The second question is related to the fact that since H is usually an 
unbounded operator, we cannot find a solution of (3.1) for every state *F 0 
prepared at some instant t , since we have to require that e ^H> 
while ^ We can solve this last difficulty by formulating the 
dynamical law by other means than the equation of motion (3.1). 

3.2. The Evolution Operator 

Consider the function exp(—(s’/#) A(£ — £ 0 )), which is continuous 
(and therefore Borel measurable) as well as bounded for A e R 1 . Due to 
these properties, we can define a bounded operator 

(3.2) U(t 9 1 0 ) = exp[-(i7*) H{t - t 0 )l 
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customarily called the time-evolution operator. In accordance with 
Theorem 2.1, U(t , t 0 ) is the operator satisfying 

(3.3) </1 U(t, t 0 )g} = f exp(—(*/*) A (t - t 0 )) d(J | E xg} 

J u 1 

for all /, g e where E x is the spectral function of //, 

(3.4) H = f A dE x . 

J u 1 

We shall prove a theorem from which the main properties of U(t , t 0 ) 
can be derived immediately. In proving that theorem we need two 
lemmas that will find many later applications. 

*Lcmma 3*1 ♦ ( Lebesgue’s dominated , or bounded , convergence 
theorem). Let j^(f), / 2 (£),... be a sequence of nonnegative functions 
defined on the measurable set R of a measure space (^, /x) and 

integrable on R. Suppose that lim n -*oo/ n (£) =/(£) almost everywhere. 
If there is a function g( |) integrable on i? and such that | f n (£) I 
for all n — 1, 2,..., then/(£) is integrable on R and 


lim f m) mo = f 

n > co j £ j £ 

Proof. By Theorem 3.4 in Chapter II, /(£) is measurable so that 

*»(£) = :k = n,n+ 1,...}, » = 1, 2,..., 

are also measurable (see Chapter II, Exercise 3.3). Since g n (£) < 2 m, 
the functions g n (£) are integrable on R (see Chapter II, Theorem 3.9). 
Furthermore, lim nH>00 £ n (|) = 0 almost everywhere on R. Since ^(1) ^ 
g 2 (£) ^ ••*, we can apply Theorem 3.10 of Chapter II to derive 

lim f g n (()M0 = 0. 

n-± oo j g 

We get the desired result by observing that |/ n (£) — /(£) I <g n {€). 

Q.E.D. 

*Lcmma 3*2 (. Patou's lemma). If /i(£), ACfJv is a sequence of 
nonnegative functions, integrable on the subset R of measure space 
( 9C y s/, [jl) and converging almost everywhere to a function/(|), and if 

f f n (€)dp(€)<M 

j R 
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for all ft, then/(|) is integrable on R and 

f mwexM. 

J R 

Proof '. Consider the functions 


gn(0 = inf{/ w (0,/ w+1 (0,...} 

which are integrable on R (see Chapter II, Exercise 3.3) and note that 
/(£) = Hindoo £ n (£) almost everywhere in R. By applying Theorem 3.10 
in Chapter II to the sequence£ x (£), we infer that/(|) is integrable 

on R and 


f = gniQMO- 

J R 7Z-»00 J £ 

Since g n (€) </ n (£) f° r all values of ft, we have 

f g n (i)d^)<M, 

J R 

which establishes that M is a bound for the integral of/(£) over R. 

Q.E.D. 

Theorem 3*1* If A is a self-adjoint operator in Jf, the operator¬ 
valued function 

(3.5) U t = e iAt , t e R\ 
has the following properties: 

(a) U 0 = 1; 

(b) U t is a unitary operator on for all t e IR 1 ; 

(c) U t U ti = U ii+k for all t 1 ,t 2 e R 1 ; 

(d) if fe& A , 

(3.6) iAf = s-lim U *f~f 

v 7 J t-±0 t 

and, conversely, the above strong limit exists only if / e 3 ) A . 

Proof, (a) If E x is the spectral function of A 

A=(\dE x , 
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then by Theorem 2.1 

(3-7) </l U t g> = f e^dif\E,gy. 

J K 1 

When we insert above t = 0, we obtain 

<f\u 0 g}= f d(f\E Ag y = (f\g} 

J R 1 

for all /, g e J which proves that U 0 = 1. 

(b) According to Theorem 4.3 in Chapter III, U t is unitary if it 
satisfies the relations 

(3.8) U t *U t = U t U t * s 1. 

The above relation can be established immediately by noting that 


and using Theorems 2.2 and 2.4. 

(c) By using Theorem 2.4 we get 

</l u h u hg y = f 

J U 1 

= f e uih+h) d<f\E !ig y 

J R 1 

= </l U t 1+h g> 

for all/, g e , which proves that U t U t — U t +i . 

(d) The domain of definition of the operator 

(3-9) (Ht)(U t -l)-iA 

is 9$ A since U t is defined everywhere on 30*. The operator (3.9) is a 
function of A, corresponding to the complex function 

(3.10) F t (\) = (1/*)(«“* — 1) — i\. 

For / e 3> a we have by Theorems 2.5 and 2.7 

= </| (^-ij-^^-i)-*)/) 

J R l I l \ 


(3.11) 
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Since we have by the mean value theorem of the differential calculus 

(1 lt)(e m — 1) — /A(cos A Orf + i sin A 0 2 t), 0 ^ 6 12 ^ 1, 

we get for all t ^ 0 and A e R 1 , 

I F*(A)| 2 < | A | 2 1 cos A 0 x t + i sin A 6 2 t — 1 | 2 < 4 | A | 2 . 

Thus, 1 F f (A)| 2 is a function majorized for t ^ 0 by the function 4 | A | 2 , 
which is integrable in the measure /x^B) = </ | £(B)/>, and for which 
| JF^A)! 2 = 0. Consequently, by Lemma 3.1 we have 

limJ Ri |F t (A)|^</|£ A /> = 0, 

and we can conclude from (3.11) that (3.6) is true. 

Assume now that 

(3.12) It)(U t - 1 )g 

exists for some g e Due to Theorems 2.1, 2.2, and 2.4 we can write 

(3.13) | \ (U t - 1 )g f = (g | ^ (U t - 1W. - l)f) 

= j Ri I 7 (e iA ‘ - 1) f dig \ E x g). 

The existence of the strong limit (3.12) implies (see Chapter III, 
Exercise 5.5) the existence of a constant M such that 

11(1 lt)(U t - 1)/|| 2 < M 

for all t e R 1 . Since in (3.13) 

lim |(1 lt)(e™ - 1)| 2 - A 2 , 


we get by applying Fatou’s lemma (Lemma 3.2) that A 2 is integrable with 
respect to the measure /x a (B) — | E(B)gy. According to the spectral 

theorem in Chapter III, §6, this implies that g e Q) A . Q.E.D. 

Theorem 3.1 shows that the operator U(t> t 0 ) in (3.2) is unitary. 
Furthermore, from (3.6) it follows that iff then for any t e R 1 


s-lim 

t-*t 0 


U(t , t 0 ) ~ 1 , i m 
t - to f ~h Hf - 


(3.14) 
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The operator U(t , t 0 ) obviously leaves 3f H invariant, i.e., if / e , then 
U(t, t 0 )f e , for all t> t 0 e R 1 . This can be seen by applying the 
spectral theorem, and noting that [E x , U(t, * 0 )] = 0, to obtain 

f A 2 ^ || E a U(t , f 0 )/||* = f A*rf|| U(t 9 t 0 ) EJ\\* 

J us 1 •'r 

= f *d\\EJ\\K 

J R 1 


From Theorem 3.1c we easily get 


( 3 . 15 ) U(t 9 t 0 )U(t 09 tJ = U(t 9 tJ. 

Combining (3.14) and (3.15), we derive that for any/e^ H 


« i- W ‘.«.)/ - a s-lim + f 

dt v ’ At^o At J 

= ih s-lim U{ ' t + A *' ~ 1 U(t, t 0 )f 

At^O At v 0JJ 

= HU(t , t 0 )f. 


Thus, if W 0 e H , then 


V{t)= U(t 9 t 0 )V 0 

satisfies the Schroedinger equation (3.1), and since U(t 0 , t 0 ) ~ 1, it 
also satisfies the initial condition 

W(t 0 ) = U(t 0 , t 0 ) V 0 = V 0 . 

We shall see in a later section that *?( t ) is, as a matter of fact, the only 
vector-valued function of t satisfying these conditions and the require¬ 
ment that || ^(011 = II II f° r a U t G R 1 - 

3.3. The Schroedinger Picture 

The formalism of quantum mechanics espoused in the last two sections 
has the striking feature that a state is represented by a vector-valued 
function W(t) in the time variable t y while the observables are represented 
by self-adjoint operators which do not depend on t. This formalism for 
quantum mechanics is called the Schroedinger picture . 

The statement that in the Schroedinger picture the observables are 
represented by operators which do not depend on t has to be accepted 
with one qualification; namely, there are cases when the Hamiltonian of 
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the system depends on t y i.e., is given by H(t). Such instances will occur 
when a system S interacts with another system Sj, and we know how the 
interaction term in the Hamiltonian depends on t. In such a case it is 
very often impractical to study the behavior of the compound system 
® + ©i > when, for instance, Q 1 is very large* and complicated. 

We shall summarize the characteristic features of the Schroedinger 
picture in the following axioms, in such a way as to include the possibility 
of the most general case when the Hamiltonian is time dependent. 

Axiom SI* A Hamiltonian H(t) is assigned to every quantum 
mechanical system; H(t) is an operator-valued function of the time 
parameter t. For each value of t e (R 1 , H(t) is a self-adjoint operator in the 
Hilbert space 34? associated with the system (see §2, Axiom O) and H(t) 
represents the total energy of the system. 

Axiom S2* All the observables of the system that are not 
functions of the energy are represented by self-adjoint operators in 34? 
which do not depend on t. Two or more observables are compatible if 
and only if their corresponding operators commute. 

Axiom S3* There is a unitary operator-valued function U(t , £ 0 ), 
t , t 0 e (R 1 , called the time-evolution operator , which is such that 

(3.16) U(t 09 t 0 )= 1, t 0 eU\ 

(3.17) U(t, h) [/& , = U(t, t 0 ) 9 t 9 t l 9 t 0 e R\ 


and is strongly continuous in t f t 0 e [R 1 , and for all f e @ H ( t ) , 


(3.18) 


ih s-lim —- / = ih s-lim -L 

h -*t t x — t t — t 2 




Axiom S4* A state of the system is represented by vector-valued 
functions x F(t) which satisfy the relation 

m = u(t 9 t 0 )v(t 0 ) 9 nm)ii>o, 

for all t y t 0 e (R 1 . If c is a nonzero complex number, } f / (t) and cV^) 
represent the same state. 


* Such a case occurs when Si is an electromagnet producing a magnetic field which 
varies with time. It is true that in principle we could study the system 0 + by 
considering the electromagnet itself to be a many-body system consisting of many atoms, 
electrons, ions, etc., but such an approach would be hopeless to pursue from the computa¬ 
tional point of view. Instead, we could measure the microscopic electromagnetic field 
produced by the electromagnet, and treat the problem by introducing in the Hamiltonian 
of the system 0 a term which is a function of that field, and represents, to a good 
approximation, the interaction of the system and the electromagnet. 



3. The Schroedinger, Heisenberg, and Interaction Pictures 


293 


We have seen that the conditions (3.16)—(3.18) are satisfied by the 
operator-valued function (3.2) when H(t) is the same for all t e U 1 . 
A certain generalization of (3.2) can be obtained when H(t) depends on 
time in some regular manner, but the considerations become quite 
involved if the operators H(t i) and H(t 2 ) do not commute for t ± ^ t 2 . 
In the time-independent case it is easy to see that (see Exercise 2.5) 

(3.19) pf w (B) = <m i e*(b) ii mw = i. 

is time independent. The physical meaning of this result is that the 
probability of measuring a certain value of the energy of the system in 
state ¥*( t ) is the same at all times, i.e., that the energy is conserved. 
Naturally, if H(t) is intrinsically time dependent, this statement does not 
stay true in general. Physically, this is due to the interaction of the system 
with an external source in the course of which energy can be transferred 
to or from the source (see also Section 7.9). 

3.4. The Heisenberg Picture and Physical Equivalence of 
Formalisms 

Let A be a bounded operator representing an observable, and let 

(3.20) <A) nt) = <U(t, 0)V\AU(t, OyP) 
be the expectation value of that observable in the state 

(3.21) W(t) = U(t , 0)^, & = ^(0). 

As we have seen in §2, the link between the quantum mechanical forma¬ 
lism and experiment is contained solely in the expectation values (3.20) 
of all possible bounded observables, so that two formalisms in which 
all expectation values are the same are physically equivalent. Since we 
can write (3.20) in the form 


(3.22) 

<A> ni) =<^| A(tfP\ 

(3.23) 

A(t) = U~ 1 (t,0)AU(t,0), 


we could describe the same physical state of the system by giving the 
vector and the operator-valued functions A(t) in (3.23) for all bounded 
observables of the system. This formalism of quantum mechanics is 
known as the Heisenberg picture . 
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If D is an unbounded operator representing an observable, the 
operators 

f)(t) = 0) DU(t , 0) 

are self-adjoint for all t e U 1 because U(t y 0) is unitary (see Chapter III, 
Lemma 4.3), and represent the same observable in the Heisenberg picture 
as D did in the Schroedinger picture. 

If the Hamiltonian H of the system is time independent, we have 


H(t) = e +a/mm H e -Himm = H 


since e~ {ilh)Ht leaves 3) n invariant and commutes with H . Thus, in that 
case ft(t) will be time independent (in physicists’ language, the energy is 
conserved) and given by the same operator as in Schroedinger picture. 

In the more general case of time-dependent Hamiltonian H(t ), we 
have in the Heisenberg picture 

(3.24) H(t) = U-\t, 0) H(i) U(t , 0). 

In the Schroedinger picture the dynamical law was the Schroedinger 
equation (3.1), which had to be satisfied by the state X P(t ). Analogously, 
in the Heisenberg picture the dynamical law is embodied in an equation 
of motion, which has to be satisfied by any operator function A(t ) 
representing an observable. Since the link between the representations 
of states and observables in the two pictures is given by (3.21), (3.23), 
and (3.24), we can derive the equations of motion in the Heisenberg 
picture from the equations of motion in the Schroedinger picture. 

Theorem 3*2* Let A(t) represent an observable in the Heisenberg 
picture, and assume that the same observable is represented by a bounded 
operator A in the Schroedinger picture. If/and A(t)f are in the domain 
of definition of /?(*) for all t e U\ then A(t)f satisfies the Heisenberg 
equation 

(3.25) ih 


Proof ‘ By definition, 

dt J ^«->o At J 


(3.26) 



3. The Schroedinger, Heisenberg, and Interaction Pictures 


295 


Straightforward algebra yields 

A(t + At) — A(t) 


(3.27) 


At 

U~\t + At, 0) AU(t + At, 0) - U-\t, 0) AU(t, 0) 


U~\t + At, 0) A 


At 

U(t + At, 0) - U(t, 0) 
At 


+ 


U~\t + At, 0) - U-\t, 0) 


At A U(t, 0). 

Whenwe get from (3.18) 

(3.28) ih s-lim + U(t,0) f = H(f) ^ Q)f 

By using the unitarity of TJ~\t + At, 0) we obtain 

(3.29) 

\\(u-'(t + At, 0) A U{ - ± At 'V ~ ^ 0) + Au-\t, 0) ^H(0 U(t, 0))/|| 

ii iT7(t 4- At — 77f7 i 


^ ^(t + _ ^, o) —c/M) + f H(t) U{ft q^i 


+ 


I(i t/-i(t + At, 0) — ^ 0)) AH{t) U(t, 0)/|. 


The first term on the right-hand side of (3.29) converges to zero due to 
(3.28) and the fact that A is bounded and therefore continuous (see 
Chapter III, Theorem 1.3), while the second term converges to zero 
since, due to (3.17) and (3.16), 

(3.30) U-\t x , t 2 ) = U(t 2 , t,) 

for any , t 2 e IR 1 , and therefore 


(3.31) 


s-lim( U~\t + At, 0) - U~\t, 0)) 

= s-lim(t/(0, t + At) - U( 0, <)) = 0. 


Thus, we see that 


(3.32) 


s-lim U~'{t + At, 0) A 


U(t + At, 0) - U(t, 0) 
At 1 


= -- \ V-K*. 0) AH{t) U(t, 0)/ = - | A(t) fl(t)f. 
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If AU(t y 0)fe^ H ( t ) y or equivalently, A(t)f eS)^ t ) , we derive by 
employing (3.18) and (3.30) 


(3.33) 


-li m U-Kt + M,0)-U-Kt,0) AU{t ' 


Jfn>0 


o)/ 


At 




= l m t) H{t) AU(t, 0)/ = i #(*) 40/- 


Now we can obtain (3.25) by applying (3.27) to/and using (3.32) and 

(3.33) when taking the strong limits for At —>0. Q.E.D. 

The Heisenberg picture is a formulation of quantum mechanics 
which is physically equivalent* to that of the Schroedinger picture. 
Consequently, its main features are derivable from Axioms S1-S4. 

In order to formulate the dynamics in the Heisenberg picture inde¬ 
pendently of the Schroedinger picture, we introduce the unitary operators 

(3.34) 0(t) - U(t, 0). 

For C(t)fe @ H ( t ) , i.e., when/e^^) , (3.17) and (3.18) yield 

ih s-lim U(t + ~ 0{t) f = Hit) Uit)f = (Jit) Hit)f. 


Furthermore, the relation (3.23) can be recast in the form 

(3.35) A(t) = A( 0) U(t). 

The above relations do not contain any direct reference to the Schroe¬ 
dinger picture. Thus, we can now characterize the Heisenberg picture as 
the formalism fulfilling the following Axioms H1-H4. 

Axiom HL A Hamiltonian fl(t) is assigned to every quantum 
mechanical system. l2(t) is an operator-valued function of the time 
parameter t. For each value of t e R 1 , /?(£) is a self-adjoint operator in 
the Hilbert space associated with the system. fi{t) represents the total 
energy of the system at time t. 


* The physical equivalence of these two pictures was established by Schroedinger 
[1926b] only after both these pictures had been independently developed by 
Heisenberg [1925], and Schroedinger [1926a], in the matrix and wave mechanics versions, 
respectively. 
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Axiom H2* A state of the system is represented by a time-inde¬ 
pendent nonzero vector *P e if c =4 0, cW represents the same state. 

Axiom H3* There is a unitary and strongly continuous operator¬ 
valued function 0(t) such that 

(3.36) 0(0) = 1, 

and such that for all 

(3.37) ih SjUrn ^ - U(t )-f = O(t) H(t)f. 

Axiom H4♦ An observable is represented by an operator-valued 
function A(t ), where for each t e U 1 , A(t) is a self-adjoint operator in Jf 7 . 
Moreover, A(t) satisfies the equation 

(3.38) A(t) = U~\t) A( 0) £/(*). 

3.5. The Formalism of Matrix Mechanics 

The Heisenberg picture and the Heisenberg equation (3.25) were first 
introduced by Heisenberg in the matrix mechanics version of quantum 
mechanics. We can obtain the matrix formulation of quantum mechanics 
for any particular system by choosing in the Hilbert space assigned 
to that system an orthonormal basis* {e 1 , e 2 ,...} which is contained in 
the domain of definition of all operator functions A(t) representing 
observables. To each A(t) we assign an infinite matrix || A ik (t )|| (see also 
Exercise 3.2), where 

A ik (t) = <e { | A(t) e k ), i,k = 1, 2,... . 

To a state represented by the normalized vector we assign an infinite 
one-column matrix || c k ||, where 

C t = <e k | #>, k = 1 , 2 . 

Note that || c k || e / 2 (oo) and that 

(Aityp), = < ei | A(tyPy = <A(t) | Vy 

= E e i i **><«* i 

/c-l 


= E o* I A f ) e kXe k i tf') = E • 


fc-1 


1 


# is in practice infinite dimensional and separable. 
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Thus, || A ik (t )|| can be considered to be an operator in / 2 (oo), which 
acts on the elements of / 2 (oo) according to the rules of matrix multiplica¬ 
tion. 

The Heisenberg matrix picture can be useful when the Hamiltonian 
ft of the system is time independent and the elements {e 1 , e 2 can 
be chosen to be eigenvectors of ft, i.e., if we can write e k — W k where 

HW k = E k W k , k = 1,2,..., 

(an example of such a system is the quantum mechanical harmonic 
oscillator; see Exercise 3.3). In that case f)(t) is represented by the 
matrix with components 

Ojt) = | e- {ilK){it V k> = . 

Consequently, noting that = 0*(t) 9 we get from (3.38) 

Mi) = £ M(t) A mn ( 0) U nk {t) = A ik ( 0) e ii/n)(E <- E > u . 

m,n—1 

3.6. The Interaction Picture 

The physical equivalence of the Schroedinger and Heisenberg pictures 
is based on the fact that the physically significant quantities, i.e., the 
expectation values (3.20), are the same in both pictures. When the Hamil¬ 
tonian is time independent, the Schroedinger picture is characterized by 
the property that the vectors representing states are time dependent, 
while the operators representing observables are time independent; in 
the Heisenberg picture the opposite is true regarding time dependence. 

It is clear that one can invent a large number of “intermediate’ ’ 
pictures “between” these two pictures, which would all preserve the 
expectation values, but in which the time dependence will be partially 
assigned to vectors representing states, and partially to operators repre¬ 
senting observables. 

The most significant example of such an intermediate picture is the 
interaction picture, which is useful when the Hamiltonian H(t) of the 
system can be written in the form 

(3.39) H = H 0 + V(t), 

where H 0 is the “free” Hamiltonian of the system (usually) describing 
a system without interactions; for instance, in the case of the w,-particle 
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system of Chapter II, §5, H 0 is the self-adjoint operator defined by the 
differential operator form 


while V is the multiplication operator defined by the potential 
F(rirj. 

In order to arrive at the interaction representation for Hamiltonians 

(3.39) , we write the expectation value (3.20) in the form 

(3.40) ' (A) wit) = <P(t) | A(t) 9{t)y, 

where 


(3.41) A(t) = e (i ‘ '»** A e - u,n)H ^, 

(3.42) ¥(t) = e U/n)H <’ t U(t, OfP; 

A(t) and l P{t) are taken to represent in the interaction picture the observ- 
able and the state which were represented in the Schroedinger picture 
by A and ^(t) = U(t, 0)W, respectively. In the case where V(t) is time 
independent, (3.42) becomes 


(3.43) 


m 


_ e -{i/n)Ht\p 


If we apply Theorem 3.2 to (3.41), we obtain that for any/ei^ c 

(3.44) *^jp-f=[A{t),HMi 


the above relation is the Heisenberg equation for the “free” system. 

Let us introduce the operator-valued function 

(3.45) H Y {t) = e {i,h)HQt V(t) e - {i,n)H ^\ 

The vector function W(t) representing a state in the interaction picture 
satisfies a Schroedinger equation with the Hamiltonian //,(£), 

(3.46) m ^) = Hl (t)9(t) 
if 9{t) e & Hl (t) (see Exercise 3.4). 
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Exercises 

3In formulating the Schroedinger equation in wave mechanics 
(see Chapter II, §5) for a state of n particles given by a wave function 
i fj(x, t ), x e (R 3n , we have been using the derivative ifr t (x , t) — d t i/j(x y t ), 


ihi/j t (x, t) = (H s tf;)(x 9 1), 
instead of the strong limit of (3.1). Show that 


s-lim ^ f + At )- 0(*> *) 
At-* o At 


t ) 


in the case when ifj t (x y t) is continuous in t and Lebesgue square integrable 
in x. 

3*2* Assume that {e 1 , e 2 ,...} is an orthonormal basis in the infinite- 
dimensional Hilbert space Jf 7 , and that A, B y ... are all the linear operators 
in which are such that their domains of definition as well as the 
domains of definition of their adjoints contain the set { e 1 , e 2 ,...}. Assign 
to each of the operator A, B,... the infinite matrices 

11-^ifclUl^fcll,-, = 1,2,..., 

where A ik = (e i \ Ae k }, etc. Show that if e 1 , e 2 ,... are in the domains 
of A + B and AB , then (A + B) ik = A ik + B ik and (AB) ik = 

33* Use the results of Chapter I, Exercise 5.4 to compute the matrix 
elements % ik (t) and P ik (t) of the harmonic-oscillator position operators 
Jt(t) and momentum operators P(t) in the basis {W 1 , W 2 ,...} of energy 
eigenfunctions. 

3*4* Prove that the vector function !?( t) representing a state in the 
interaction picture and satisfying the relation *P( t) — e {x ^ )H ° l U(t, 0) ^(0) 
also satisfies the equation 

ih *m = 

if *?(£) e n @ H ( t ) and Hj(t) is given by (3.45). 

3*5♦ Show that if A is a self-adjoint operator in and if 

+«])/ = / for some real a > 0, then 


e iAtf = 


s-lim Y 

n-*+co 


(iAtf 

k\ 


oo 

/=£ 


fc -0 


(iAt) k , 
~kT } ' 



4* State Vectors and Observables of Compound Systems 


301 


3*6* A well-known phenomenon in wave mechanics is the evanes¬ 
cence of wave packets. For example, if H Q is the kinetic energy Schroe- 
dinger operator in L 2 ((R 3 ), given by (H 0 ifj)~(p) = (p 2 /2m) i/*(p), then it 
can be proved* that 



c —u/*)tf o < 0)( r )|2 dr =0 


if B 0 is any bounded Borel set in (R 3 . Prove that for any open bounded 
set B 0 the expansion 

e -mt ^ = £ *(r) e «*>(*„), 

fc=o n ' 

is incorrect [though (H 0 n i/j)(r) = (— (fi 2 /2m)A) n i/r(r) is well defined, 
since i/r(r) is infinitely many times differentiable and of bounded support] 
because it contradicts the evanescence property of e~( ilh> > HQt i fj. 


4* State Vectors and Observables of Compound Systems 

4.1. Superselection Rules and State Vectors 

Until now we have been talking mainly in terms of the Schroedinger 
picture of quantum mechanics, in which the observables are represented 
by time-independent self-adjoint operators, while the states are vector¬ 
valued functions of t. In §3 we also encountered, however, the Heisenberg 
picture, in which the observables are represented by time-dependent 
operator-valued functions, while the states are time-independent vectors, 
as well as other pictures of quantum mechanics. In each of these pictures 
we can define the concept of state vector as a vector W which can represent, 
at least at one instant t , some state of the system. The natural question 
to ask is whether all the vectors in the Hilbert space assigned to the 
system (§2, Axiom O) are state vectors. 

It is immediately clear from the last section that the zero vector 0 can 
never be a state vector. Inasmuch as we do not adopt the requirement that 
a state vector be normalized (a request which is a matter of mere 
convenience), we can say that it was generally believed until the early 
1950s that all the other vectors in are state vectors in any quantum 
mechanical theory. A paper by Wick et al. [1952] dispelled that belief 
by pointing out the existence of superselection rules in quantum field 
theories. 


# See Exercise 7. 7 in Chapter V. 
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The presently known superselection rules are those of charge, baryon 
number, and univalence. These superselection rules state essentially 
that the Hilbert space is a (in general infinite) direct sum of so-called 
coherent subspaces , M 2 and that only the nonzero vectors belonging 
to these subspaces are state vectors. However, these superselection rules 
are primarily of significance only in theories treating systems with a 
variable number of particles, for instance in field theories. Thus, the 
charge superselection rule states that if and W 2 are eigenvectors of the 
charge operator with eigenvalues e x and e 2 , respectively, where e 1 ^ e 2 , 
then afP 1 + ^ 2^2 * s n °t a state vector if a 1 ^ 0 and a 2 ^ 0. Since we are 
primarily concerned with nonrelativistic quantum mechanics, in which 
the nature of the particles and their number does not change, the 
existence of the above-mentioned superselection rules will not be of any 
immediate concern to us. 

4.2. The Hilbert Space of Compound Systems 

The main task of quantum mechanics is to study the interaction of 
different parts of a system. Nonrelativistic quantum mechanics deals 
with the case when the nature of these parts remains unchanged during 
the entire duration of the experiment, so that each one of these parts can 
be thought of as a stable “particle.”* The question which naturally 
arises is whether we know how to construct the formalism for a more 
complex system if we know which are the formalisms of the constituent 
particles. 

In the case that a system © consists of n different other systems 
(e.g., “particles”) ,..., © n with which the Hilbert spaces ,..., 
are associated, respectively, then the rule is to associate with © the 
Hilbert tensor product ^ ® • • • ® . Let us assume, in addi¬ 

tion, that at a certain instant t we know that the n systems Q 1 ,..., © n are 
in states represented at that instant by the state vectors e ,..., 
x F n e , respectively. Then quantum mechanics rules that the entire 
system © is at that instant in a state represented by the state vector 
^V® ••• ® n • Since in this case we can talk about the state vectors 
of the individual systems © x ,..., © n separately, we say that © x ,..., © n 
move independently at time t. However, in general, the state of © will 

* It should be noted that the use of the word “particle” does not preclude the possibility 
of an intricate internal structure. Thus, an a-particle consists of two protons and two 
neutrons, a deuteron consists of one proton and one neutron, etc. In fact, there is a 
tendency in modern elementary-particle physics to treat all particles on an equal footing, 
and talk about the transformation or decomposition of one particle into other particles 
rather than of breaking up of a particle into constituent parts. 



4* State Vectors and Observables of Compound Systems 


303 


not be representable at later times by a tensor product of n vectors 
from , respectively. Due to the interaction of ©j © n , 

the state vectors representing the state of © at later times can be vectors 
of a more general form in 

4.3. Tensor Products of Linear Operators 

Let the operator A k be an observable of the system © fc . In the case 
where © fc is an independent part of the system ©, we can certainly 
measure A k . Hence, this question arises: which mathematical entity 
related to the Hilbert space Jff — ^ ® • • • ® represents this 
observable. In order to answer this question, we have to define the 
following concept. 

Definition 4*1 ♦ Let A x ,..., A n be n bounded linear operators acting 
on the Hilbert spaces , respectively. The tensor product 

A x ® ••• ® A n of these n operators is that bounded linear operator on 
^ ® ••• ® c W n which acts on a vector ® ••• ®f n , f x e ^ ,..., 
f n e , in the following way: 

(4.1) (A x ® ••• ® AhX/l ® ••• ®/ n ) = AJ X ® ••• ® A n f n . 

The above relation (4.1) determines the operator A x ® ••• ® A n on 
the set of all vectors of the form f x ® ••• ®f n , and therefore, due to the 
presupposed linearity of A 1 ® ••• ® A n , on the linear manifold spanned 
by all such vectors. Since this linear manifold is dense in ® ••• ® J^ n 
and the operator defined by (4.1) in this manifold is bounded (see Exer¬ 
cise 4.1), it has a unique extension to ® *** ® • Consequently, the 

above definition is consistent. 

Coming back now to the earlier posed question of how to represent in 
^ 0 ••• ®, W n an observable of © fc representable in by the 
bounded operator A k , we see that the natural candidate is 

(4.2) A k = 1 ® ••• ® 1 ® A k (x) 1 ® ••• ® 1. 

The fact that this is the right choice can be established by verifying that 
the physically meaningful quantities, i.e., the expectation values (2.17), 
are the same for A k when © fc is an isolated system in the state W k at t , 
or when <5 k is at the instant t an independent part of ©, which is at t 
in the state W x 0 ••• 0 W k ® ••• ® l F n ; namely, if we take the nor¬ 
malized state vectors W 1 ,..., W n , then 

<??! ® - ® ¥n\ (1 ® - ® A ® - 01)^0 - 0 ^ w > 

- <^i I ^>1 - <n I A k W h \ - <W n | ^ M > = (V k | A k W k } k . 
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We note that in Definition 4.1 we defined A x (x) ••• (x) A n only in the 
case when A 1 A n are bounded. In case of unbounded operators 
difficulties appear with the domain of definition of A 1 ,..., A n . If the 
domains of definition of A 1 ,..., A n are 2# A ,..., 2# A , respectively, then 
we can define A 1 (x) ••• (x) uniquely by (4.1) at least on the algebraic 
tensor product 2$ A (x) a ••• . However, if A 1 A n were self- 

adjoint unbounded operators, then the A 1 (x) ••• ® A n so defined might 
not be self-adjoint when defined on such a domain. 

When we are trying to define the operator A k which would be a 
generalization of (4.2) to the case when A k is unbounded, we can cir¬ 
cumvent the above-mentioned difficulty by considering the spectral 
measure E Ak (B) of A k . Since 

(4.3) E k (B) = 1 ® ® E Ak (B) (x) ••• (x) 1, Be 2# 1 , 

is also a spectral measure of projectors in (see Exercise 4.5) we can 
define A k as the self-adjoint operator with the spectral measure (4.3) 
(see Exercise 2.6). 

4.4. The Observables of a System of Distinct Particles 

Let (9 1 ,..., (9 n be the sets of observables corresponding to the systems 
Si,..., ® n , respectively, and denote by & 1 ,..., & n the same observables 
represented by operators in (x) • • • ® , such as the operators 

(4.2) in the case of bounded observables. Naturally, it is true that the 
set (9 of observables of the compound system S contains U • • • U & n . 
However, in general it contains additional observables. 

One way of obtaining additional observables is by taking functions of 
compatible observables from & 1 U ••• U & n . For instance, let H k be the 
operator in M ? k representing the energy observable of the system <Z k , 
i.e., the Hamiltonian of <£ k , and let E Hk (B) be its spectral measure. 
Denote by H k the energy operator of in i.e., the self-adjoint 
operator with a spectral measure built from E Hk (B) according to (4.3). 
It is very easy to check that the spectral measures of H i and Hj commute, 
i.e., that H i and Hj are compatible for any i, j = I,..., n. Hence, 
according to Theorem 2.7, the operator H 1 + + $n is self- 

adjoint. This operator obviously represents the total energy of the 
system 6 in the case when Q 1 ,..., ® n are independent, since it gives us 
the probability of obtaining energy values E ± + ••• + E n for S when 
the probabilities for the energies E x ,..., E n of Sj,..., S n , respectively, 
are known. 
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The operator H which represents the total energy of the system 
® = {®! ® n } is usually taken to be of the form 

H3B 1 + -+g n +v, 

where V is a self-adjoint operator referred to as the interaction part of H . 
Indeed, the presence of V reflects, from the physical point of view, 
the existence of interaction between the n parts ® n of ®. 

Let us illustrate the above consideration with the case of n different 
particles without spin, which was treated in §5 of Chapter II in the 
context of wave mechanics. In that case was taken to be L 2 ((R 3n ), and 
we can take ^ = ••• = = L 2 (1R 3 ). The space L 2 ((R 3n ) is isomorphic 

to L 2 (R 3 ) ® ••• ®L 2 ([R 3 ) (n factors), and according to Theorem 6.9 
of Chapter II, the mapping 

(4.4) </'i(r 1 ) ® •" ® M*n) 'Ai(fi) ••• 0»(r») 

induces a unitary transformation U between these two Hilbert spaces. 
The operator H k is in this case given essentially by the differential oper¬ 
ator form —(fi 2 l2m k ) A k , and therefore ft 1 + ”■ + &n is essentially 
—((^ 2 /2m 1 ) A, + ••• + (fi 2 l2m n ) A n ), and represents in this case the 
kinetic energy of the system ®. The interaction term V is determined by 
the potential V(r 1 ,..., r n ), 

W)(rir w ) = K(r x r w ) <A(r,..., r w ). 

We note that H corresponds to the Schroedinger operator form (5.8) in 
Chapter II. 

4.5. Symmetric and Antisymmetric Tensor Products 
of Hilbert Spaces 

We have emphasized many times that our considerations on compound 
systems did not apply, up to this point, to systems containing identical 
particles. A widening of the scope of our considerations can be achieved 
only after the introduction of some additional mathematical material on 
tensor products. 

Definition 4*2* The closed linear subspace of the Hilbert space 
M' = ® • • • ® ^ = • • • = 2/f n , spanned by all the vectors 

of the form 

£ A, ® ®fk n , fl . fn e > 

(A!| I ... , k n ) 


(4.5) A ® S - ® S /n = ^ 
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where the sum is taken over all the permutations k n ) of ( 1 ,..., n), 

is called the symmetric tensor product of , and it is denoted by 

^ (x) s ••• (x) s Jtr n or 2>ef n . 

Similarly, in the case of the set of all vectors in & of the form 


(4.6) 

/i ® A - ® A /„ = 

1 y 

Vnl 



A >•••>/« e 

where 



(4.7) 

w(*i K) 

-K 


7r(&i k n )f ki (X) ••• ®/* 

.*»> 

* L, 


if (Aj k n ) is even 

if {k x k n ) is odd, 


n 9 


the closed linear subspace spanned by this set is called the antisymmetric 
tensor product of Jfi, and it is denoted by 



The factor (n!) _1/2 in (4.5) and (4.6) has been introduced for the sake 
of convenience in dealing with orthonormal bases (see Exercise 4.2). 
In case that f x ,...,/ n are orthogonal to each other and normalized in 
, then due to this factor, the symmetric and antisymmetric tensor 
products of these vectors will be also normalized. 

If we take the inner product of a vector (4.5) with a vector (4.6) we 
get zero. This implies that is orthogonal to (see Exercise 4.3), 

when these spaces are treated as subspaces of 

If A 1 ,..., A n are bounded linear operators on M y 1 , then A 1 (x) ••• ® A n 
will not leave, in general, Jtp Sn or invariant. We can define, 

however, the symmetric tensor product 

(4.8) A x ® s ••• ® s A n = £ A ki ® — ®A kn 

(*!.* n > 

which leaves and invariant, and which can be considered 

therefore as defining operators on these respective spaces, by restricting 
the domain of definition of this operator to these spaces. 


4.6. The Connection between Spin and Statistics 

Let us consider now a system S of n identical particles of spin a. 
We recall from the Introduction that the spin projection in some direction 
can assume only integer and half-integer values. If is the Hilbert space 
associated with each particle in this system, then not every nonzero 
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vector in is a state vector. It is an assumption of quantum 

mechanics, which is consistent with the known experimental material, 
that the Hilbert space of state vectors associated with S is the sym¬ 
metric tensor product in the case that the particles have integer 

spin, and the antisymmetric tensor product in the case that the 

particles have half-integer spin. 

The above assumption is based, primarily, on the statistics of gases. 
A given macroscopic volume of gas can be considered to be a system S 
with a large number n of identicle particles which are, in the first approx¬ 
imation, independent. Well-known statistical consideration, starting 
with a basic assumption of equal a priori probability for each particle to 
have its energy within a certain range, lead to a certain distribution of 
particles (energy distribution) along the energy spectrum. This procedure 
involves counting the number of all possible states of the system which 
are compatible with a certain energy and some other exterior conditions. 
It has been observed that the assumption that the state W of the system 
(e.g., gas) S in which particle S fc is in the state W k and particle in 
the state W t is identical to the state of S in which <5 k is in the state W\ 
and S j in the state W k leads to a correct energy distribution in the case 
of particles of integer spin. This assumption is usually referred to as the 
principle of indistinguishability of identical particles in quantum mechanics, 
and the statistical considerations based on it are referred to as the Bose- 
Einstein statistics. It reflects the fact that the symmetric tensor product 
X F 1 ® s • • • ® s *F n does not change under a permutation of W k and W t . 

In the case of a gas consisting of particles of half-integer spin, it has 
been observed that one must add to the assumption of indistinguishability 
of particles the assumption that no two particles can be in the same state. 
This last assumption is referred to as the Pauli exclusion principle (which 
we have met already in Chapter II, §7) and the statistics based on it is 
called the Fermi-Dirac statistics. The Pauli exclusion principle reflects 
the mathematical feature of the antisymmetric tensor product 
X P 1 ® A ••• ® AX F n that it is obviously equal to the zero vector if two 
vectors W k and W l in it are identical for some k =4 /. 

The indistinguishability of identical particles is reflected in the further 
natural assumption that those observables A of the system S which are 
built from observables of individual free particles, are symmetric tensor 
products (4.8) of these. Thus, in measuring the expectation values <A 
of some particle, we cannot say whether it refers to the first, second,..., 
or nth particle, since obviously for any such A 
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Let E(B), B g SB 1 ' be the spectral measure of some observable of 
a single particle from our system of identical particles. Then 
1 

n \ E(B) (x) s ••• ® s E(B) (n factors) 

is a projector, as follows from the fact that it is self-adjoint (see Exercise 
4.4) and idempotent (see Exercise 4.5), 

(E(B) ® - ® E(B)Y = (E(B))* (x) - (x) ( E(B)f = E(B) <g> - ® E(B). 

Similarly, by using the definition (4.8) and Exercise 4.5, it is easy to 
check that 

(4.9) £(£0 ® s - ® S E(B n ), B { r\ B } — 0 for i ^ j, 

is self-adjoint and idempotent, and therefore it is a projector. More 
generally, if in the family 

(4.10) ,..., B ^ Ui ,..., Bfci ,..., B kUk , ^1 ~f~ *“ ~f~ ftjc == 

of n Borel sets, any two sets with the same first index are identical while 
any two sets with different first indices are disjoint, then 

(4.11) [W ® s - © s E(B ln j] ® s - 

n l 1 n k- 

®s [E(B kl ) ®s ... 0 s E{Bknk)] 

is a projector (see Exercise 4.6). Thus, the expectation value of (4.11) for 
some state W(t) in the Schroedinger picture gives us the probability of 
having n i particles within the sets B ix = ••• = B in for each i — 1,..., k. 
However, if B x ,..., B n are not pairwise disjoint, then the operator (4.9) 
is not, in general, a projector, and its expectation value does not yield the 
probability of finding at some instant one particle in B x , one particle in 
B 2 , etc. This probability is given by the expectation value of a projector 
which is the sum of projectors of the form (4.11), obtained by writing 
B x (x) ••• (x) B n as a union of disjoint sets of the form (4.10) and adding 
the operators (4.11) corresponding to these sets. 

4.7. Spin and Statistics for the a-Body Problem 

As an illustration of the above consideration on the connection between 
spin and statistics we shall formulate the wave mechanics for n identical 
particles of spin or. Wave mechanics postulates that a state vector will be 
represented in this case by a wavefunction 

(4.12) 0(r x r w , * w ), r k e [R 3 , 


s k = —a, —a + 1,..., a, 
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which is Lebesgue square integrable and symmetric 

(4.13) </<(..., r i , s t ,..., Tj , s, ,...) = <A(..., r i , s j ,..., r,, s t ,...) 
if cr is an integer, or antisymmetric 

(4.14) J VJ J v) = 0(***J J ^3 V*j *•••) 

if cr is half-integer. It is easy to see that the Hilbert spaces of functions 
(4.13) and (4.14) in which the inner product is taken to be 

(4.15) <f\g>= £ - £ f 3 f^(r l9 s l9 ... 9 r n9 s n ) 

s 1= - a s n =—a J K 3 ” 

X g( r x , ,..., r n , s n ) dri ... dr n , 

are unitarily equivalent under the unitary transformation induced by 
the mapping 

(4.16) 0(r x , s x ) (x) ••• (x) </>( r n , s n ) -► 0(r x , *i) — <A(r w , * w ) 

to the spaces and respectively, where (see Exercise 4.7) 

(4.17) = L 2 (IR 3 ) ® ••• ©L 2 ([R 3 ) (2cj + 1 terms). 

The Hamiltonian of the system is usually taken to be of the form 

HDT S + V, 

where T s is the kinetic energy operator essentially given (see Theorem 
7.3) by (—H 2 l2m)(A 1 -f ••• -f A n ). Thus, T s is already symmetric with 
respect to the n particles. If V is the potential energy given by a potential 
V(r 1 , s i ,•••, r n , s n)y then the principle of indistinguishability of identical 
particles requires that this function is symmetric under any permutation 
of the indices 1 ,..., n. 

We shall now summarize the main conclusions reached in this section 
in the following axioms. 

Axiom PL Let S by a system consisting of k • n particles Si 1 *,..., 
S*, 1 *,..., Si**,..., S^*, where the particles carrying the same superscript 
are of the same kind, while those carrying different superscripts belong to 
distinct species. If Jt {v) is the Hilbert space associated with each one of 
the particles Si y) ,..., Sj,^ > v = K > then the Hilbert space for S is 
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where 0 should be taken to be the symmetric tensor product if the 
corresponding systems are of integer spin, or the antisymmetric tensor 
product if the corresponding systems are of half-integral spin. 

Axiom P2* In the absence of superselection rules, every nonzero 
vector in is a state vector. 

Axiom P3* If at some instant t the particles S} 1 *,..., ©^j are free 
and in the states l / / { 1) ,..., , respectively, then the state of © at that 

instant is represented in the Schroedinger picture by 

(4.19) ® ® <>) 0 - 0 

Axiom P4* If E is a projector in representing in the Schroe¬ 
dinger picture an observable of the particle ©^ when that system is free, 
then the same observable is represented in Jf 7 by the projector 

(4.20) 10-0 10 [E0_-_0 E 

n v factors 

n V ~1 /ft \ —1 1 

+ £ ( r ’) . E 0 s - 0 s E 0S (1 - £)0S • • • 0S(1 - E ). ] 010-01, 
r=1 n v — r factors r factors 

where the operator in brackets acts in (Jf ,(l ' ) )® n *'. 

It is easy to derive from (4.20) the expression for any projector related 
to measurements on the system ©, when the constituent parts of © are 
independent (see Exercise 4.8). In particular, we can arrive at the expres¬ 
sion (4.11) for measurements on systems of identical particles. 

Exercises 

4*1 ♦ Prove that the linear operator A x (x) ••• (x) A n on the separable 
Hilbert space (x) ••• (x) has the bound \\ A 1 \\ ••• || A n ||. 

4*2* Show that if e ± , e 2 ,... is an orthonormal basis in , then 
{e ki ® s ••• ® s e kn : k x ,..., k n = 1,2,...} is an orthogonal basis in 
whereas {e ki (x) A ••• (x) A e kn : ,..., == 1, 2,...} is an orthonormal 

basis in ^f Kn . 

4 3* Prove that the subspaces and of are mutually 

orthogonal. 

4 A* Show that if A x ,..., A n are self-adjoint bounded operators on 
,..., , respectively, then A ± (x) ••• (x) A n is also self-adjoint and 

bounded. 
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4*5* Prove that for any bounded linear operators A ± A n and 
B ± B n , and any scalar a 

a(A x ® — ® A) = (a^) (x) — ® A n = ••• = A 1 ® ••• <g) (a^ n ), 

(A + ^i) (8) A 2 ® ••• (x) 

= A (8) ^2 ® — (§) + Bi (g A 2 (X) — ® A n , 

(A (g) — ® A n )(B x (g) — (g) B n ) = (A#i) (g — <g) (^A)- 

Show that the first two of the above relations are true also for symmetric 
tensor products, while the third one is not true, in general, for symmetric 
tensor products. 

4*6* Use (4.8) and the results of Exercises 4.4 and 4.5 to prove that 
(4.11) is a projector. 

4*7* Prove that the spaces £ a 2 (IR 3n ) 8ym and £ a 2 (IR 3n ) anti consisting, 
respectively, of all the symmetric (4.13) and antisymmetric (4.14) func¬ 
tions which are Lebesgue square integrable in the variables r x ,..., r n 
are unitarily equivalent to and respectively [with given 

in (4.17)], under the linear transformation induced by (4.16). 

4*8* If E {v) (B) is the spectral measure of some observable of the 
system = ••• = ©j^ (see Axiom PI), use Axiom P4 to build the 
projector whose expectation value gives the probability of observing 
simultaneously one particle in B ( ^\ one in 2?* 10 , etc. for all v = 1,..., k , 
when the particles in © are independent. 

4*9* Let ZJ 1 ,..., U n be unitary transformations of the Hilbert spaces 
onto the Hilbert spaces Define U 1 (g ••• ®U n 

as that linear transformation of ^ ® • • • (g) ^\' into = 

^i" (g) — ®^n which maps f x ® ••• ®f n into ® ••• ® U n f n 

(see Definition 4.1). Prove that U 1 ® ••• ® U n is a unitary transformation 
of onto ^ . 

*5# Complete Sets of Observables 

5.1. The Concept of a Complete Set of Operators 

The concept of a complete set of observables was first introduced by 
Dirac [1930] and it has remained in extensive use ever since. The 
original formulation is, however, merely heuristic in the general case, 
and it becomes rigorous only in the special case of a set of self-adjoint 



312 


IV* The Axiomatic Structure of Quantum Mechanics 


operators A x A n acting in a separable Hilbert space ^ and having 
pure point spectra (which must be countable since Jf is separable), 

k = l 

Under these circumstances {A 1 A n } is called a complete set of 
operators if the following statements are true for some S C U n : 

(a) To each n-tuple (A x A n ) e S C X ••• X S An can be assigned 

a vector ¥ / Ai . A ^ from the common domain of definition of A x A n 

which satisfies 


(5.1) A k W Xi . An = A^ Ai . An , k = 1, 

(b) The family of all vectors 
(5-2) . An , (A x A w ) g S, 


is an orthonormal basis in Jf. 

Note that the completeness of the set [A 1 A n } implies that the 
operators A x A n have a common dense domain of definition 

containing the linear manifold spanned by all the vectors . A ^ , and 

that they commute with one another on that domain of definition. 

Another fact which has to be borne in mind is that, due to Bose- 
Einstein or Fermi-Dirac statistics, not all vectors (5.2) represent physical 
states, but only those vectors which are appropriately symmetrized or 
antisymmetrized with respect to the indices X ± X n referring to the 
same observables of identical particles of integer or half-integer spin, 
respectively. In the presence of identical particles, the Hilbert space of 
physical states will not be Jf, but rather a closed subspace Jf of 
constructed in accordance with Axiom PI in §4, in which the appro¬ 
priately symmetrized vectors in (5.2) will constitute an orthonormal 
basis. The operators A x ,..., A n will not, in general, leave invariant. 
(Consider, e.g., the spin operators of identical particles in wave 
mechanics.) However, appropriately symmetrized polynomials 
p(A x ,..., A n ) in A x ,..., A n leave if invariant. 

Definition 5.L Let S be a system consisting of a finite number of 
particles, and A 1 ,..., A n obvervables of single particles from that system. 
We say that the function f(x x ,..., x n )> x x ,..., x n e R 1 , is appropriately 
symmetrized with respect to {A ± ,..., A n } if it is symmetric (antisymmetric) 


/(..., x t ,..., x i ,...) = ±/(..., X, ,..., x { ,...) 
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with respect to any two of its variables x t and x i , whenever A i and Aj 
represent the same observable quantity (e.g., energy, spin projection, 
momentum component) of two identical particles of integer (half¬ 
integer) spin. 

Since the set (5.2) is an orthonormal basis in Jf, we can expand any 
vector W e in terms of that basis, 

(5-3) W= £ <^(AjA„) . 

(A x .A n )eS 

(5-4) = .A ,|y>, 

where we have 

(5.5) £ I <f>(K ,•••> ^™)| 2 < +00. 

(A x .A n )eS 

Denote by Z 2 (S), where S C S^i X ••• X S^ n , the space of all functions 
^(A x ,..., X n ) defined on S and satisfying (5.5). It is easy to see that Z 2 (S) is a 
separable Hilbert space and that the one-to-one mapping 

(5.6) W <-> <^»(A X ,..., A n ) 

is a unitary mapping of Ji? onto Z 2 (S). 

On the other hand, the space Z 2 (S) is isomorphic to the space 
L*(U n , fjt) of all properly symmetrized and antisymmetrized (in the case 
where some of the particles in the system are identical) complex functions 
^(A x ,..., X n ) which are square integrable on U n in the measure /x 

(5.7) f | 0(A X A w )| 2 dfx(X 1 A w ) < +oo, 

J u n 

where ja denotes the measure on the Borel sets of lR n , having support 
S, and such that 

(5.8) M{A}) = 1, A = (A x ,..., A w ) g S, 

(5.9) S C x ••• x S^. 

In other words, for an arbitrary Borel set B in IR n , [x(B) is equal to the 
number of the points in the set Bn S. 

The elements of the Hilbert space Z 2 (IR n , /x) are equivalence classes of 
almost everywhere equal functions, i.e., the square-integrable (in the 
measure fx) functions ^i(x) and ^ 2 (^)» x e represent the same element 
of Z a (R w , fx) if 0i(A) = ifi a (A) for all A e S. The wave functions yj(x) are 
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such that if x ± and x 2 are variables corresponding to the same observables 
of two identical particles, then 

0(*"> x x x 2 ,...) = lL0(***> X 2 >•••> X 1 >•••)> 

where the plus sign has to be taken in the case that the particles have 
integer spin, and the minus sign for particles of half-integral spin. Thus, 
Z 2 (IR n , /x) is, in general, a closed linear subspace of the space L 2 (IR n , /x) 
of all functions on U n which are square integrable in the measure /x. In 
the case where all particles are different, L 2 (U n 9 /x) is identical to Z 2 (IR n , /x). 
The mapping 

(5.10) 0(A) <-> 0(*), 0 e / 2 (S), 0 eL 2 (U- 9 m ), 

which is such that 0(A) = 0(A) for all A e S, is a unitary transformation of 
/ 2 (S) onto Z 2 (IR n ,/x). Thus, (5.6) and (5.10) determine a unitary trans¬ 
formation 

(5.11) W<-+<f>(x) = UW 9 WeJf 9 0gL 2 ( R»,/x), 

of onto L 2 (U n 9 /x). 

If we denote, in general, by 3 A the domain of definition of an operator 
A 9 then we have that for W e 3 A 

A k W = A k £ 0(A) = £ W) , 

AeS AeS 

where A fc 0(A) e / 2 (S). Thus, W e 3) A if and only if i/j(x) = UW is from the 
domain of definition 3 A} , of the operator A k acting on L 2 (IR n , /x) as 
follows: 

(5.12) (A k ip)( x ) = **0(*), 0 e 3 Ak ' 9 

3 A} , obviously consists of all 0 eL 2 (IR n , fx) for which x k i/j(x) is square 
integrable, i.e., 

(5.13) ® Ak - = |</>: i/> e L\U n , (i), x h 2 | >/>(x)\ 2 dn(x) < +°o j. 

Moreover, we obviously have 

(5.14) A k ' = UA k U~\ 

The existence of a unitary transformation (5.11) of onto Z 2 (R n , fi) 
which is such that the operators Ai,..., A n ' y defined in (5.14), satisfy 
(5.12) and (5.13) is evidently a necessary and sufficient condition for 
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A x A n to be complete in the sense of the definition given at the 
beginning of this section. 

This approach to the concept of a complete set of self-adjoint operators 
A x A n with discrete spectra leads to a straightforward and natural 
generalization to the case of self-adjoint operators A x A n with 
arbitrary spectra. 

Definition 5*2* The self-adjoint operators A ± ,..., A n , acting in 
the Hilbert space Jf, constitute a complete set of operators in the Hilbert 
space Jf’, which is a subspace of if the following three conditions are 
met: 

(a) There is a measure fi(B) in the Borel sets B of the n-dimensional 

Euclidean space U n with support S C S'* 1 X ••• X S\ where $ An 

are the spectra of A x A n , respectively. 

(b) There is a unitary transformation U of 3$ onto the Hilbert space 
L 2 (IR n , /x), such that the operators 

A k ' = UA k U~\ * = 1 

are the multiplication operators 

(A k 'if;)(x) = x k iP(x), if,e @ Ak ', 

with domains of definition 

®A k ' = | <f>- J *** I '/'Ml 2 d H-( x ) < +°°, t eZ, 2 (R”, m)|- 

(c) U maps onto the subspace £ 2 ((R n , /x) of L 2 (R n , /x) consisting of 
all appropriately symmetrized functions from L 2 (lR n , /x). 

If the first two of the above requirements are fulfilled, the Hilbert space 
L 2 ([R n , fx) is called a spectral representation space of the operators 
A 1 A n , and A/,..., A n ' the spectral representation (or canonical form) 
of the operators A 1 A n . 

If A x is an operator such that {A J is complete (i.e., n = 1 in the above 
definition) then A x is said to be an operator with a simple spectrum. 

5.2. Cyclic Vectors and Complete Sets of Operators 

The significance of the concept of a complete set of operators in 
quantum mechanics is reflected in the following assumption, which is 
ordinarily tacitly made in nonrelativistic quantum mechanics. 

Axiom C* For every quantum mechanical system 6 there is a 
finite set {A 1 A n } of operators in jf*, which is complete in the Hilbert 
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space Jf, C e#, associated with that system, and such that, in the 
absence of superselection rules, every function F(A 1 A n ) which 
leaves invariant and, when restricted to Jf, is a self-adjoint operator 
in Jf, represents an observable of S. 

To give an example of a complete set of operators of the kind 
mentioned in the above axiom, consider a system S of n particles without 
spin. We have seen in the preceding section that if all the particles in the 
system are different, then in wave mechanics we associate with 6 the 
Hilbert space L 2 ((R 3n ). In this case J(? = 2$ = L 2 (R 3n ) and the position 
operators 

(XiWri r w ) = *<0(r x r w ), 

(*V)(rir w ) = r x r w ), 

v, r w ) = *<0(r x r w ), i = 1,..., n, 

constitute a complete set of operators, which in this case represent 
observables. Thus, in this example L 2 (U 3n ) is the spectral representation 
space of the position observables and the above introduced operators 
X i , Y i , Z i , i = 1,..., w, are the canonical forms of these observables. 

In case that some of the particles in the system are identical, we have 
seen in the preceding section that L 2 ((R 3n ) is not the Hilbert space 
associated with S; instead, the space L 2 (R 2n ) of appropriately 
symmetrized Lebesgue square-integrable functions is the space 30? 
associated with 6. In this case, the operators , Y i9 , i = 1,..., n, 
acting in L 2 ((R 3n ) = 3# are still a complete set of operators, but they do 
not represent observables since they do not leave = Z 2 ((R 3n ) invariant. 
However, any operator A = F(X 1 , Y x , Z x ,..., X n , Y n , Z n ) 

(Ai/j)(ri ,..., r w ) = ^(ri ,..., r w ) 0(r x r w ), 

where F(r 1 ,..., r n ) is a real appropriately symmetrized and bounded 
Borel-measurable function, will represent an observable of 6; for 
instance, the projectors (4.11) and (4.20) are such functions. 

It has become customary to refer to any set {A 1 ,..., A n } obeying 
Axiom C as a complete set of observables even when, strictly speaking, 
only their appropriately symmetrized functions are observables. 

Additional examples of complete sets of observables will be 
encountered in §7, when the formalism of wave mechanics will be 
derived from the general quantum mechanical formalism espoused in this 
chapter (see also Sections 4.1-4.4 in Chapter V). 

We shall relate the completeness property of A x ,..., A n to the existence 
of a cyclic vector for A 1 ,..., A n . 
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Definition 5.3. The vector W 0 of the Hilbert space Jf 7 is a cyclic 
vector in Jf 7 with respect to the commuting operators A 1 A n in the 
Hilbert space «#, where D Jf, if the following three conditions are 
fulfilled: 

(a) W Q e& Ai n - n& An . 

(b) For any integers k x k n = 0, 1, 2,..., 

(5.15) A* - A*"P 0 e9 Al n - n 9^. 

(c) The linear manifold 

(5.16) 9{V 0 ; A ,,..., A n ) = {p(A 1 ,..., A n ) 

spanned by all vectors of the ioxmp{A 1 ,..., A n ) , corresponding to all 
choices of appropriately symmetrized polynomials p in A x ,..., A n , is 
everywhere dense in 

In the remainder of this section as well as the next section, we shall 
gradually establish the following main theorem. 

Theorem 5*L The set of self-adjoint operators A 1 ,..., A n acting in 
the separable Hilbert space Jf 7 and with range in $ D Jf 7 is complete if 
and only if there is a vector e ^ which is cyclic with respect to 
A x A n , and A x ,..., A n commute on the dense linear manifold 
^(¥*0 ; A 1 ,..., A n ) generated by applying to all possible polynomial 
forms p(A 1 ,..., A n ) which are appropriately symmetrized. 


To prove that the completeness of A 1 ,..., A n on a separable implies 
the existence of a cyclic vector W 0 for A x ,..., A n we first verify Lemmas 
5.1 and 5.2. 

Lemma 5*L If L 2 (R n , (jl) is unitarily equivalent to a separable 
Hilbert space Jff, then the measure fi is or finite. 

Proof. If jju is not a finite, then there is an uncountable family of 
mutually disjoint measurable sets of nonzero measure. Denote by Xs( x ) 
the characteristic function of the set S. The functions 

1 

V^s) 


e s (x) = 


Xs (x), S 



318 


IV* The Axiomatic Structure of Quantum Mechanics 


constitute an orthonormal system in L 2 (U n , /x), since for S x , S 2 e we 
have 

|0 if S x ^ S 2 

«*(*) «s,(*) = \ 1 ■■ / ^ : f o _ o 

( f S1 ' 

Under a unitary transformation of L 2 (U n , /x) onto Jf 7 , the images of all 
elements from ^ would consitute an uncountable orthonormal system 
in Jf 7 . This is impossible if Ji? is separable. Q.E.D. 

*Lcmma 5*2* Let /x be a or-finite measure and h(x) a continuous 
positive function representing an element of Z, 2 (IR n , /x). The function 

(5.17) f 0 (x) = exp(—a(| x x | + ••• + | * n |» h(x), <x > 0, 
represents an element of L 2 (IR n , /x), and the family of all functions 

(5.18) P(x)f 0 (x) 9 

obtained by letting p(x) range over all polynomials in x eR n is dense 
in L 2 (IR n , /x). 

Proof. Since we have 

(5.19) I ••• exp(— oc(| x t | ••• | x n |) | 

< • K\ 

at *! -1 - 9 


the functions (5.18), which include the function f 0 (x) itself, are square 
integrable because 


I *n r 


^ _ I I _ l "n i _ 

" fl (‘ + tt + ••• + lf Sr) 


I P( x )fo ( x )\ 2 < const I %)l 2 - 

Due to the a finiteness of /x, the family of all continuous functions on 
U n which represent elements from L 2 (IR n , /x) and are of compact support 
is dense in L 2 (M n , /x) (see Exercise 5.1). Hence, it is sufficient to show that 
any continuous function g(x) of compact support and representing an 
element of L 2 (U n , /x) can be approximated arbitrarily well in the norm 


ii/ii 2 =f \m\ 2 d^x) t 

J R n 


by a function of the form (5.18). 
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We carry out the change of variables 

^ > u — p(x) = (w(x i),..., zv(x n )), 

where w( y) is some analytic function on R 1 , mapping R 1 in a one-to-one 
manner into a finite interval I and such that | d n wjdy n | ^ 1 for all 
n — 0,1, 2,... (such a function can be built from exponential functions). 
Under this transformation R™ is mapped onto the w-dimensional rectangle 
R = / X • • X /. 

Since g(x) is of compact support, the continuous function f 0 (x) is 
bounded from below on the support of g(x ), and therefore the function 
g(x)lf 0 (x) is integrable and square integrable on U n . 

The function ^ r [p~ 1 (w)]// 0 [p~ 1 (w)], where p -1 (w) is the inverse of p(x ), is 
continuous and of compact support R. Hence, according to the Weier- 
strass approximation theorem (see Chapter II, Theorem 7.4), we can 
approximate that function arbitrarily well on R by a polynomial, i.e., we 
can find a polynomial 


q(u) = £ a mi . m u' 


for which 


glp-\u)\ 




2 II/ 0 II ’ 


ue R. 


If we denote by the measure in R resulting after the change of variable 
x —► u — p(x), we have 


(5.20) f \g(x)~q(p(x))f 0 (x)\*d h (x) 

J IB 71 


f \fo[p~ 1 (uW 

J R 


g[p-\u)} 




fo[p~»] 


-q(u)\ dp^u) 


4 ‘ 


By expanding each w(x k ) according to the Taylor’s formula to the order 
k, we obtain 


i).»(*.)) = E«m 1 ... mn K*i)] mi - N*»)r n 

= Pk(x) + r k (x). 
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where p k (x) is a polynomial of the order n(k — 1), and r k (x) is the 
remainder. A simple computation shows that this remainder can be 
majorized as follows: 


*(*)! < 


£ k* 


c i 




£ I 




,fc-l 

h 


x *k x *k +1 


where the coefficients c t are constants built from the coefficients of q(u)> 
and can be chosen to be independent of k , due to the requirement that 
| d n wldy n |^1. 

By employing (5.19) we get 

T P I ••• % kn |2 -|l/2 1 r /• -i 1/2 

By taking advantage of the relation 
(5.22) [j^ | Ux) + f 2 (x)\* |/ 0 (*)|« d,4x)] 1/2 

< [J B . l/i(*)l* l/o(*)r M*)] 1 * + [| r „ I/2WI 2 I AM! 2 

which is a direct consequence of the triangle inequality whenever 
/1 -/o . U -fo e ^ 2 (R n , /*), we obtain by using (5.21) 

[J Rn l/o(*)l 2 1 PA*) - q[ P (xW MX)} 1 " 

= [J r „ I /<>(*) r i£ x )\ 2 d i4*)\ 


<r SHo 1 1 ^ i 

^ a n(fc-D ^ 2 

for sufficiently large values of k when a > 1. Combining the above relation 
and (5.20), we arrive (by using the triangle inequality) at the inequality 

f I g( x ) ~ Pk(x)f 0 (x )\ 2 dp(x) < e 2 , 

J re” 

which is the desired result when a > 1. 
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The case with 0 < oc < 1 can be reduced to the preceding case by 
substituting the variables x k by the new variables x k = 2ocx k . Q.E.D. 

As a direct consequence of Lemmas 5.1 and 5.2 we get the following 
theorem upon appropriate symmetrization of f 0 (x) in (5.17). 

Theorem 5*2* If the Hilbert space £(IR n , /x) is separable, it contains 
a vector which is cyclic with respect to the canonical operators ^4/,..., A n ' 
defined in (5.12). 

5.3. The Construction of Spectral Representation Spaces 

We shall prove now that if there is a cyclic vector for the self-adjoint 
operators A 1 ,..., A n , then {A 1 ,..., A n } is a complete set. The proof will 
be obtained by constructing the representation space Z 2 (IR n , /x). 

*Lemma 5 3* Assume that/e M* is a cyclic vector with respect to 
the self-adjoint operators A ± ,..., A n in the separable Hilbert space 
and that /x is the measure satisfying the relation 

(5.23) </1 p(A ,,..., A n )f} = f p(x) drfx), m = </1 E A * . An (B) />, 

for all polynomials p(x). Then there is a unique unitary operator C7, 
mapping L 2 (R n , /x) onto such that 

( 5 - 24 ) U(xi • p) = p(A ,,..., A n ) E a * \I) 

for all appropriately symmetrized polynomials p(x) and all finite intervals 
I in [R n . 

Proof. Note first that every polynomial p{x) is square integrable 
on with respect to the measure /x since 

f n \P(*)\ 2 M*) = f p*(x) p(x) d^x) 

J R n J R n 

- </1 (P(A,A n ))* p(A ,,..., A n )f> 

= \\p(A l9 ...,A n )f\\*<+co. 

Define an operator U 0 from Z 2 (R W , /x) into Jf by 

U 0 (P ‘ xi) = P(A t ,...,A)^. An (I\ 
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where p(x) is any appropriately symmetrized polynomial. The so-defined 
operator U 0 is isometric, 


II U 0 (p • x/)lf = \\P{Ai . A n ) E a ' .'Ho/ll 2 

= <E A ' . (PiA A n ))* p(A x A n ) E A - .^(/)/> 

= <E Al . Hl)f\P*{A r A n ) p(A 1 A n ) E a ' . A »(I)f} 

= f x*( x )(P*( x ) P( x ) xA x )) M x ) = f I P( x ) X/WI 2 M X Y, 

J R n J u n 

in deriving the second step we have made use of the fact that 
P*(A X A n ) p(A 1 A n ) is a polynomial in A t A n . 

The measure /x is finite and, therefore, the family of all continuous 
functions in L 2 (R n , /x) of compact support is dense in L 2 (R n , jjl ) (see 
Exercise 5.1). Since, according to the Weierstrass theorem, each such 
function can be approximated uniformly and arbitrarily well by a 
function of the form Xi( x ) P( x )> where I is a finite interval and p(x) is an 
appropriately symmetrized polynomial, it follows that the family of all 
such functions is dense in£ 2 ((R n , /x). Thus, the domain of definition of the 
operator U 0 can be extended to L 2 (R n , /x) in a unique manner, and the 
operator U obtained in this way is isometric. Moreover, since / is cyclic 
with respect to A 1 ,..., A n , the family of all vectors p(A 1 ,..., A n )f is 
dense in and, therefore, the family of all vectors 

^. An {I)P(^ — A n )f, 

is also dense in M". Hence, the range of U 0 is dense in Jf, i.e., the range 
of U has to be Thus, U is unitary. Q.E.D. 

Note that if we take a sequence C / 2 C • • • of intervals which are 
such that ^U/jU • • • = [R n , then Xi (^) P( x ) converges strongly to p(x) 

and E A * . A «{I)p{A 1 A n )f converges strongly to p(A 1 A n )f. 

Since 


U(p-x h ) = E A1 A "(I h )p(A 1 A n )f 

and U is bounded and therefore continuous, we get in the limit 

Up = p(A x A n )f ; 

By using Lemma 5.3 we can easily prove Theorem 5.3. 

Theorem 53* Let /e«# be a cyclic vector with respect to the 
self-adjoint operators A 1 ,..., A n in the Hilbert space let /x be the 
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finite measure defined in (5.23) and U the unitary operator mapping 
L 2 ((R n , [jl) onto e# and satisfying (5.24). Then the self-adjoint operators 

U~ x A k U, k = 

in L 2 (U n y /a) are the (/x-almost everywhere) multiplication operators 

= x k ifj(x k ), if> e 2 V _i AkU , 

and therefore for any of the operators A in Theorem 2.6, 

( U-'A U*)(x) = F(x) ft*), 0 e 3 v - lAU • 


Proof. We shall prove the theorem by showing that ZJ^AjJJ 
coincides on functions of the form Xi( x ) P( x ) (J * s a finite interval) with 
the operator A k ', 


(A k i/j)(x) = x k ftx), $ e @ Ak ', 


with domain 


®A k ’ = | rM x * 2 I H x )\ ad K x ) < +°°|- 

Since x k p(x) is a polynomial when p(x) is a polynomial, we have 

[U~ x A k U(p • *,)](*) = [U-'AbPiA,A n ) E a i A «(I)](x) 

= x kP(x) Xi( x )- 

Now, ZJ- 1 A k U is self-adjoint because A k is self-adjoint and U is unitary, 
while A k is known to be self-adjoint. Thus, the operators U^AjfU and 
A k must be identical because they coincide on the set of all functions 
h(x) = Xi{ x )P{ x )> which is dense inL 2 ([R n , /x). In fact, since ([/“M^C/)* = 
U~ x A k U and p is any vector in 2u-i AkU , we have 

(u-'AjcUifj | hy = (i/j | u- x A k uhy = f 0*(*) x k h(x) d^(x) 

= f (Xk>P( x ))* K x ) dfJ.(x) 

j R n 

for all h(x) = p(x) Xi{ x )> which implies that (C/ -1 ^4 fc C/0)(^) = x k ift(x); 
the representation of U^AU follows in a similar manner. Q.E.D. 
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It is evident that Theorems 5.2 and 5.3 imply Theorem 5.1. 

It should be noted that a complete set of operators can have many 
spectral representation spaces corresponding to different measures /x. 
For instance, if we take the case of the single operator X inL 2 ([R 1 ), 

(Xi/j)(x) — xi/j(x), 

which has a simple spectrum, then e ~ x2 is an example of a cyclic vector 
for this operator, since the family of all functions p(x) e ~ x2 / 2 [where p(x) 
is a polynomial] is dense in L^R 1 ) (see Chapter II, Exercise 7.9). For a 
polynomial p(x ), x e [R 1 , we shall have 


< e~ x ‘ 2 | p(X) e~ x *y = f p(x) e~ 2x 2 dx = f p(x) dfi(x) 9 
J R 1 J R 1 


where 


dfx(x) = e 2x2 dx. 


Thus, L 2 ((R 1 , /x) will be another spectral representation space of X. 


5.4. Cyclicity and Maximality 

We have seen in the previous section that in the case that n identical 
particles are present, it can happen that the self-adjoint operators 
A 1 ,..., A n corresponding to observables of individual particles will not, 
in general, represent observables of the entire n-particle system. Instead, 
appropriately symmetrized functions of A 1 ,..., A n will represent 
observables, provided we are working in the Schroedinger picture. 

A special case of such functions will be the projectors E^ An (B) 

corresponding to Borel sets B whose characteristic functions Xb( x ) 
are appropriately symmetrized. 

It is therefore convenient to express the property of / 0 being cyclic 
in terms of projection operators. The next two theorems are related 
to this problem, and lead to the important Theorem 5.6. 

Theorem 5*4* If/ 0 is a cyclic vector in $ with respect to the self- 
adjoint commuting operators A x ,..., A n in «#, then the linear manifold 

(5.25) . A %:XeR n ) 

spanned by all vectors E Al, -”’ An f 0 , A e IR n , is dense in 3$. 
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Proof. Assume that the set (5.25) is not dense in 3$. Then there 
exists a nonzero vector g which is orthogonal to all the vectors in this 
set, i.e., 


<g\Ef' . A %y^0, AeR”. 

Using Theorem 2.7 [see also (2.28)], we can write for any polynomial 
P(AiA n ) 


<g I p(A ,,..., A n )f 0 y = f p( A) dig I e£ . An f 0 y. 

J M n 

Then the identity <g \ Ef 1,m " ,An f o y = 0 implies that 

<g\p(A l 9 ... 9 A n )f o y^O 

for any polynomial p(A x A n ), which contradicts the assumption that 
/ 0 is cyclic with respect to A x ,..., A n , i.e., that the set of all vectors 
p(A 1 ,..., A n )f 0 , corresponding to all polynomials p( A), is dense in 3$. 

Q.E.D 

The following result represents a converse, in a limited sense, of the 
above theorem. 

^Theorem 5*5* If g 0 is a vector in the Hilbert space 3>P which is 
cyclic with respect to {E Al ''"' An : A e R n } so that 

(5.26) {Efr . An g 0 : A e U n ) 

is dense in «#, then any vector / 0 of the form 

(5.27) / 0 = exp[ a(| A x | + ••• + | A n |)] g 0 , a > 0, 

is a cyclic vector with respect to the self-adjoint commuting operators 
A\ >•••> A n . 

Proof. The vector / 0 in (5.27) is in the domain of definition of any 

polynomial p(A x ,..., ^4 n ), since,..., A n ) exp[—a(| X x \ + -b I K l)]> 

Aj A n G IR 1 , is a bounded function and therefore 

f l/KAiA n ) exp[-«(| Aj I + ••• + I A„ |)]| 2 d\\ Ef 1 ^J 2 < +oo. 

J u n 

rf / () were not cyclic with respect to A x ,..., A n , then the linear manifold 

(5.28) (E* * exp[—a(| A t \ + - + \ A n \)]g 0 : Xe R”) 
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would not be dense in Jf 5 , i.e., there would be a nonzero vector 
/orthogonal to (5.28), 

</l . An exp[—a(| A x | + - + | A n |)]* 0 > = 0, Ag R« 

This implies that 

(5.29) f p( A) exp[—a(| A, | + - + | A w |)] d(f \ .^ 0 > 

J u n 

= f jW *<f I ^ ^ ex P[~" a (l A x | + - + \A n |)] £ 0 > = 0. 

J M n 

Since the measure 1 


(5.30) fi(B) = </ 1 E*' . \B) B g «», 

is finite, we can take in Lemma 5.2 h( A) = 1, and conclude that the set 
of all functions 

(5.31) p(A) exp[—a(| | + ••• + | A„ |)] 


is dense ini 2 (R w , fi). Thus, due to the fact that any characteristic function 
X#(A) of a Borel set B is square integrable in the measure (5.30), it 
can be approximated arbitrarily well in the mean in the measure (5.30) 
by a function of the form (5.31). By taking such an approximation to the 
limit and using (5.29) we obtain 

f n XBW d <f\ Et . An go> = </l E 4 * . An (B)go> = 0. 

J u 

This implies that/is orthogonal to the set (5.26). But this is impossible if 
/ ^ 0 and (5.26) is dense in Q.E.D. 

In relation to the most general case encompassed in Definition 5.3, 
Theorems 5.4 and 5.5 refer to the case when 30* = 2$. The formulation 
of the equivalent of these theorems in the more general case when 
is a nontrivial closed subspace of $ can be achieved by replacing 3$ by 
in the above two theorems, and substituting the vectors in (5.25) and 
(5.26) by their projections onto Jf. This means, de facto , replacing the 
projector-valued function in the A x ,..., A n g IR 1 by an appro¬ 

priately symmetrized operator-valued function 
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where the above sum is carried out only over those permutations of 
(A x A n ) which permute the same observable of identical particles. The 
alterations in the proofs of Theorems 5.4 and 5.5, which are necessary 
when we are dealing with this more general case, are very easy to carry 
out; they are left to the reader. 

The following theorem states the important maximality property 
of the algebra generated by a complete set of observables. 

Theorem 5*6* If the bounded operator A> defined on the Hilbert 
space commutes with the operators A 1 ,..., A n , where {A x ,..., A n } 
is a complete set of operators, then A is a function of A 1 ,..., A n . 

Proof. Since {A x ,..., A n } is a complete set, there is at least one vector 
/ 0 g / which is cyclic with respect to A 1 ,..., A n . According to Theorem 
5.4, the linear manifold (5.26) is dense in 3$. Consequently, there is a 
sequence/i ,/ 2 ,... 



which converges strongly to Af 0 . If we introduce the function 


where 


F m ( A) = a® 0(A - a£>) + ... + a ye(\ - *£■>), 



for A < 0 
for A > 0, 


then we observe that/ m = F m (A 1 ,..., A n )f 0 . Since 


II/. ~fm II 2 = f I F t ( A) - F m (A)|* d || . A % || 2 

and/ x ,/ 2 ,... converges, we deduce that /^(A), F 2 (X),... is a sequence of 
Borel measurable functions which is fundamental in the mean with 
respect to the measure (jl 0 (B) = || E A ^ ,m - ,An (B)f 0 || 2 , Be£8 n . According 
to the Riesz-Fischer theorem (Chapter II, Theorem 4.4), /^(A), F 2 (A),... 
converges in the mean, with respect to the measure /x 0 , to a /x 0 -square- 
integrable function F( A). Hence, by using Theorem 2.5, we arrive at the 
conclusion that 


(5.32) Af 0 =F(A l9 ...,A n )f 0 . 

Since A commutes with it commutes with any operator 

D = . A « H - 1- a m E^ . A \ 
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It is easy to check that Df 0 e & F u lt ...,A n ) > and 

ADf 0 = DAf 0 = DF(A 1 ,..., A n )f 0 = 2^,..., i4 tt ) Df 0 . 

Thus, A and F(A 1 ^l n ) are operators which coincide on the set of all 
vectors Df 0 , i.e., on the dense linear manifold (5.26). Now, A is bounded 
and 2f A = ^ while F(A 1 ,..., A n ) is closed, since by Theorem 2.6, 
F(A 1 ,..., A n ) also has to be the adjoint of the densely defined operator 
F*(A 1 ,..., A n ). Hence, we conclude* that A — F(A 1 ,..., A n ). Q.E.D. 

Exercises 

5*1* Prove that if /x is a cr-finite measure, then the family of all 
infinitely many-times differentiable functions which are of compact 
support and belonging to Z 2 (R W , /x) is dense in Z 2 (R n , /x). 

5.2. Use Lemma 5.2 to deduce that the set of all functions p(x) e~^ x \ 
P > 0, where p(x) are polynomials in x e 1R 1 , is dense in L 2 ((R 1 ). 

5.3. Let {^lj 1) ,..., -4^},..., {^4i m) ,..., A { ^} be complete sets of obser¬ 
vables of the systems ,..., 6 n , respectively, and let these operators act 
in the spaces ,..., (which have as subspaces the Hilbert spaces 

>•••> associated with ,..., 6 m ). Denote by Aj. u) the self-adjoint 
operator with the spectral measure 1 (x) ••• (x) Ej. v) (B) ® ••• (x) 1 [see 
(4.3)], where E^\B) is the spectral measure of A { r v) . Show that 
{^lj 1) ,..., A J^} is a complete set of observables of the 

system 6 — {Si ,..., S m }. 

5.4. Show by making use of the definition of a spectral measure, 
that if [£ ,(1 ) (jB 1 ), E (2) (B 2 )] = 0, the set S of all vectors / satisfying 
E (l) (B l )E (2) (B 2 )f = /, where jB x , B 2 are any bounded Borel sets in R 1 , 
is dense in If ^ and A 2 are the self-adjoint operators corresponding 
to E a) (B) and E (2) (B) y respectively, prove that S C @ Ai a 2 n ^ 2/ll and 
that A ± A 2 f — A 2 A x f for all / eS. 

5.5. Show that if is any self-adjoint operator and E(B ), B e^? 1 , 

is its spectral measure, then for any bounded Borel set R we have 
®ae(r) = ^ , and AE(R)f == E(R)Af for all/e ^ , i.e., 

AE(R) is the extension of E(R)A to the entire Hilbert space. 


* In other words, A necessarily belongs to the Abelian algebra generated by A x A n , 
and therefore {A x A n } is complete if and only if that algebra is maximally Abelian. 
This observation is especially significant when dealing with infinite sets of commuting 
observables (de Dormale and Gautrin [1975]). 
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6♦ Canonical Commutation Relations 

6.1. The Empirical Significance 
of Commutation Relations 

One of the most striking features of the position and momentum 
observables defined in (1.1)—(1.6) (where, in general, we can take 
i — 1,..., n) is that all position operators as well as all momentum 
operators commute among themselves, while this is not true for each pair 
consisting of a position operator and a momentum operator. As a matter 
of fact, it is very easy to see that a position operator corresponding to the 
position coordinate of a certain particle in the system commutes with any 
momentum operator except the one belonging to the same momentum 
component of the same particle; for instance, Q commutes with P\ y) 
and Pf\ i — 1,..., n, as well as with Pj** for i = 2,..., n, but we have 

(6.1) (Qt ] P^ - pFqFW = iH, <P e n % i( *) 0i (*). 

This can be easily verified by taking for i fs differentiable functions: 

^ ih ^ *AC*T >•••> x n) ^ ^ ^ x w )) = ihi/j(x 1 x n ). 

The commutation relations satisfied by the position and momentum 
operators in wave mechanics are called canonical commutation relations. 
In general, given 2 n self-adjoint operators ,•••> £?n > Pi>—>P n a 
Hilbert space Jf, we say that they satisfy canonical commutation relations 
in which Q k is canonically conjugate to P k , k — 1,..., w, if 

(6.2) ( QjcPjc ~ PkQk)f = iM, k = 1,..., n, 
for any / belonging to the domain of definition of 

[Qk > Pk] — QkPk — PkQk j 

while any other two operators from {Q 1 ,...,0 n , P 1 P n } commute. 

In the first section of this chapter we identified commutativity of 
operators representing observables with compatibility of those 
observables. The empirical meaning of compatibility was given in 
Definition 1.1. From the discussion in §1 we must draw the conclusion 
that canonically conjugate observables are incompatible, i.e., that there 
is reason to believe that there is a lower limit to the accuracy with which 
simultaneous measurements of such observables can be carried out on the 
same system. Such is the case with canonically conjugate position and 
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momentum observables, as revealed by an extensive analysis of different 
“ gedanken ” experiments (see Messiah [1962, Vol. I]). 

It is important to realize that the commutation relations satisfied by 
canonically conjugate observables also imply that the outcomes of many 
measurements of those observables on systems in the same state possess 
certain statistical features. This conclusion can be drawn from the 
following result. 

Lemma 6*1* If / is a vector in the domains of definition of the 
operators A 2 y [A t , A 2 ], and A 2 2 , where A ± and A 2 are self-adjoint 
operators in Jff, then 

(6.3) </1 AtfXf I A 2 /> > il</1 \A X , A 2 ]f >|». 

Proof. Since for any real value of A 

</1 A 2 /> - Kf I i[A x , i*J/> + A \f | Atf} 

- </1 [A 2 -&(iM a - AA 1 ) + AM 2 *]/> 

= </l (A 1 +zXA 2 )(A 1 -i\A 2 )f} 

= ((A 1 —iAA 2 )f\ (A 1 —iXA 2 )fy > 0, 

we deduce that Im </1 i \A X , ^4 2 ]/> = 0, and the discriminant of the 
above polynomial of second degree in A has to be nonpositive. Thus, we 
have 

l</1 [A , A 2 ]fy |» - 4</1 AtfXf I A 2 f> < 0, 

and (6.3) follows immediately. Q.E.D. 

Let us take a normalized state vector W from the domain of Q k 2 y P k 2 , 
and [Q k , P fc J. Since 

[A,, A 2 W = ihW, 

where 

A 1 =Q i -<'F\ Q k vy, A 2 = P k — <w \ P k wy , 
we get, by applying Lemma 6.1, 

(6.4) <T I (Q k - (W I Q k wyfwy<w I (P k - (w I p k wyfwy > £ 2 /4. 

The quantities cr y (J3 fc ) and oy(P k ), where 

°v\Q k ) = f (A - <y | Qt'Py? d II E°"P II 2 = I (Q k - (V I Q k W))W), 

J u 1 

(6.5) 

°Ap*) = f (a - <y i pjpy? d ii Ep? ii 2 = <y i (p r - i wm 

j R i 
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are the standard deviations of the probability measures || E Qk (B) W || 2 and 
|| E P *(B) ^H 2 , respectively. Thus, inequality (6.4), which can also be 
written in the form 

a w{Qk) °w(Bk) ^ ^/2, 

indicates that we cannot find states for which the standard deviations 
of canonically conjugate position and momentum are both arbitrarily 
small. 

The standard deviations o w (Q^) and cr w (P k ) can be obtained experi¬ 
mentally by preparing an ensemble of systems in the same state, and then 
carrying out at the same instant, when the systems are in a state 
represented by W, a measurement of Q k on the systems in one part of the 
ensemble, and a measurement of P k on the remaining systems in the 
ensemble. This procedure, which is implicit in the theory of mea¬ 
surement of §1, enables us to compute || E Qk (B) W \\ 2 and || E Pk (B) ^H 2 , 
from which cr^^) and o w (P k ) can be computed. 

6.2. Representations of Canonical 
Commutation Relations 

If Q 1 ,..., Q n , Pj ,..., P n are self-adjoint operators satisfying canonical 
commutation relations in which Q k and P k are canonical conjugate pairs, 
we write, symbolically, 

(6-6) [Qo , Q k ] = [P, , P fc ] = 0, [Q 5 , P k ] = iM jk , j,k = 1,..., n . 

Any set of 2 n operators satisfying the above relations is called a 
representation of the canonical commutation relations. 

If U is a unitary operator, it is easy to see that the self-adjoint operators 

(6.7) QS = UQ t U-\ P/ = UP,U-\ j = 1,..., n, 

constitute another representation of the canonical commutation relation 
(6.6). In fact, the spectral measures of Q k and P k are 

E°*\B) = UE q \B) U~\ E P *'(B) = UE P \B) U~\ 

and it is therefore easily verified that 

[E° l (B 1 ), E p *Xb 2 )] = U[E°*(BJ, E P \B 2 )\ C/-1 = 0, k, 

and that Qi,..., Q n ’, as well as P/,..., P n ', commute among themselves. 
Moreover, for/e^o/v-Vo/ = vs> Q 1 p 1 -p i Q i we have 

[Q/, P/]f = UIQ, , PA u-y = ihuu-y = ihf. 
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Two representations {< Q x P n } and {< Q /,..., P n '} of (6.6) which satisfy 
(6.7) are said to be unitarily equivalent. The question arises whether all 
representations of (6.6) are unitarily equivalent. The answer to this 
question has physical implications, since unitarily equivalent representa¬ 
tions are physically indistinguishable. As a matter of fact, assume that 
there are two quantum mechanical theories and describing the 
same system. If the observables of the two theories are related in a 
one-to-one fashion, 


A <-> A\ ieJ, A'ef, 

and there is a unitary transformation of onto such that 

A' = UAU~\ 

then, by setting W = UW, we map all the states of one theory onto the 
set of all states of the other theory in a one-to-one fashion. The only 
physically important quantities, namely the expectation values of 
bounded observables, 

(A')r' = <W\ A'wy = <JJW\ UAU-'UWy 
= <V\AVy = <A > V9 

are the same in both theories, and therefore and IF* are physically 
indistinguishable. 

From the mathematical point of view, the question of whether all 
representations of (6.6) are unitarily equivalent is a version of a 
uniqueness problem, which consists in finding out the manner in which 
all solutions of (6.6) are related. We already know that there are solutions 
to the problem, one of them being the Schroedinger representation 

(£?fc*AX‘*'l j***j x n) ~ x Jc 0(*i j***j ^w)? 

(6 ’ 8) a 

(Pfc<A)(*l vj x n) = lh */*(#i ,•••? x n )j 

in which = L 2 ((R n ). 

Due to difficulties with the domains of definition of unbounded 
operators, which do not coincide with the entire Hilbert space, the 
solution of the problem posed above becomes feasible only after the 
the problem is rephrased in a technically different fashion. This is 
achieved by the following heuristic procedure. 
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Assume that P and Q is a canonically conjugate pair 

(6.9) [Q,P]=ih. 

If P and Q are treated like bounded operators and (6.9) like a relation 
valid on the entire Hilbert space, one obtains by induction (see Exercise 
6.2, and for rigorous results see Putnam [1967]), 

(6.10) [Q,P n ] = ihnPn- 1 . 

Setting, formally (see Exercises 3.5 and 3.6), 


e iPu 


00 


= £ 


n =0 


(iPu) n 

n\ 


U G R 1 , 


and using (6.10), one derives in the same vein 


00 (iu\ n 

[Q,* iPu ] = Z -[0,P*] = -*« ^ 

n =0 71 ' 

Multiplication of the above relation from the left by e~ iuP yields 

e~ iPu Q e iPu = Q — hu. 

By taking the nth power of the above equation, one obtains 

e -iPuQn e iPu = (g _ 


Consequently, for v e U 1 


0 —iPu giQv giPu 


00 

= £ e~ iPu 

w=0 


{ivQY 

n\ 


e iPu 


_ * (iQv - ihuv Y ivQ _ itiuv 

~ L n \ ~ e 

n =0 

Finally, multiplying the above relation from the left by e iPu , one gets 
the Weyl relation , 

(6.11) e iQv e iPu = e~ ihuv e iPu e iQv , u,ve R 1 . 

The above relation involves only bounded operators defined on the 
entire Hilbert space. It is advantageous to adopt as a starting point in our 
study of representations of canonical commutation relations the Weyl 
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relation (6.11), which has a precise meaning, instead of the symbolic 
relation (6.9). It will be shown that commutation relations, such as 
[Qi > Q 2 I = 0, can also be replaced by relations 

e i Q m e w 2 v = e w 2 v e i Ql u^ v e R i # 

involving only bounded operators. Such considerations necessitate an 
understanding of the basic properties of operator families e iiA , f e R 1 , 
where A = A*. 

6.3. One-Parameter Abelian Groups of Unitary Operators 
T he basic properties of the family 

(6.12) U t = e iAt , fell 1 , 

are given in Theorem 3.1. It follows immediately from this theorem that 
U 0 = 1, C/7 1 = U t * = U _ t , and U t U h = U t U %i . These three 

features of the operator family U t , t e JR 1 , characterize it as an Abelian 
group of unitary operators. Moreover, this family is weakly continuous 
in £, i.e. (see Exercise 6.3) 

(6.13) w-lim U t = U h . 

It is a significant result, which constitutes one of the fundamental 
theorems of functional analysis, called Stone’s theorem, that every 
continuous one-parameter group of operators is of the form (6.12). The 
key word in this statement is “continuous.” The following lemma 
contributes to the understanding of continuity in functions U(t), t e U\ 
where U(t) are unitary operators. 

Lemma 6*2* If / = w-lim n ^/ n and \\f n || < ||/||, then 

/ = s-lim^oo f n . 

Proof. Since </1 /»> -> </1 / > = || /1| 2 , we have 

11/ ~fn II 2 = ll/ll 2 - 2 Re</ | f n y + || f n || 2 
= - ll/ll 2 + € n + ||/ n IP, 

where e n -> 0 when n -> + 00 . On the other hand, ||/ n || 2 — ||/|| 2 < 0, 
so that 


0<||/-/nl! 2 <€ n . 

Thus, ||/ — f n || -> 0 when n -> + 00 . Q.E.D. 
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Assume that U(t) are unitary operators for all t e IR 1 , and 

U(t 0 ) = w-lim U(t), 

t~*t 0 

i.e., [/(£)/converges weakly to U(t 0 )f for any/e Since 
II U(t)f\\ = ll/ll = || U(t 0 )f\\, 

we can apply the above lemma and infer that U(t)f converges strongly to 
U(t 0 )f for any fe i.e., 


U(t 0 ) = s-lim U(t). 

t-*t 0 

Thus, in the case of one-parameter families of unitary operators, weak 
continuity implies strong continuity. 

^Theorem 6*1 ( Stone's theorem). To every weakly continuous, 

one-parameter family U(t), t e R 1 , of unitary operators on a Hilbert space 
Jf, obeying 


U{ti + * 2 ) = U(t 2 ), t ly t 2 e R 1 , 


corresponds a unique self-adjoint operator 


such that U(t) = e iAt for all t e R 1 . 

Proof. We shall prove the theorem by building the spectral function 
E a a of A from the operators U(t), t e R 1 . 

Consider first the special case of families U x {t) which are periodic in t , 
with the period In, i.e., U x (t + 2tt) = U x (t). We shall prove that there 
is a family of mutually orthogonal projectors E n , n = 0, ±1, ±2,... 
with a sum equal to the unit operator, 


(6.14) 


+00 

z 


E n = s-lim 

n r-*+00 


I E„= 1, 


such that Ux(t) can be expanded in the following series: 

4.00 +r 

Ujit) = £ e^E n = s-lim £ e™E n . 


(6.15) 
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It is easy to verify that 

r 2n 

(J\g) n = (ll2n) e-*-\f\U 1 (t)g}dt 
J 0 

are bounded bilinear forms in/and g. It follows from Riesz’ theorem (see 
Chapter III, Exercise 2.5) that there is a unique bounded linear operator 
satisfying < /1 E n g > = (/1 g) n for all f,geJf?. In the symbolic notation 

/.2rr 

(6.16) £ n = (l/27r)| e-^U^dt 

J o 

it becomes obvious that E n would be the Fourier coefficient of U x (t) if 
U x (t) were an ordinary function. 

The operator E n is self-adjoint, since 

<f\E n *g> = <E n f\g>=<g\E n f}* 

/»2tt 

= (1/2*) f e^{g | U x (t)f}* dt 
J 0 
/.2rr 

— (1/2*) f e^Xf\ U x *{t)g)dt 

J 0 

p2n 

= (1/20 f e in Kf\ U x {-i)g) dt 
J 0 

2 IT 

= —(1/2*) f «-*"*«</ 1 U x (t x ) g y dt x 
J o 

= (1/2*) f° e~ int Xf I U x (t x ) g y dt x 

J -2ir 
(•2 IT 

- (1/2*) f *-*•“.</ I u x (t 3 ) g y dt 2 = </1 E ng y, 

J 0 


where the last step was carried out by setting * 2 = h + and taking 
into account that + 2n) = U^). Moreover, using again the 

periodicity of U x (t ), we obtain (cf. Chapter V, Theorem 3.4) 

(•2 IT 

U x (t) E n = (1/2*) f e~^U x (t + t x ) dt x 
J 0 

/•t-\-2lT 

= (1/2*) e int er***tU x (t a ) dt 2 

J t 
»2it 

= (1/2*) e int e~ int »U x (t 3 ) dt 3 ■= e int E n , 

J 0 


(6.17) 
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and consequently 

/*2tt /»2 it 

E m E n = (1/2^7) f e-^U^t) E n dt = (1/277) j e-«™~*»E n dt 
j 0 J 0 

_ (E n for m = n 

(0 for m n. 


Thus, we have established that E 0 , E ±1 ,... are mutually orthogonal 
projectors. Hence, it follows from Theorem 3.9 in Chapter III that 


E = 



is also a projector. 

It is obvious that 1 — E is orthogonal to all projectors E n . 
Consequently 

/.2 7T 

(6.18) (1/277) f e-^Kf I £4(0(1 - E)g> dt 

J 0 

= <f\E n (l -E)g> = 0, n = 0,±1,... 

for all f.geJrif. Since </1 —■ E)g} is a continuous periodic 

function in t , of period 27r, and has vanishing Fourier coefficients (6.18) 
on the interval [0, 27r], it follows that </ | — E)g ) = 0 for all 

t eU 1 and all /, £ eJ#*. Hence, £/i(0)(l — 2?) = (1 — E) = 0, i.e., 
E = 1. Thus, (6.14) is established. 

Multplying (6.14) from the left by U(t) and using (6.17), we obtain 
(see also Chapter III, Exercise 5.6) 

U t (t) = s-lim £ U&) E n = s-lim £ e™*E n . 

tl~* 00 w 

n=-r n=-r 

This establishes that (6.15) is true when U(t) is periodic in t. 

Let us now turn our attention to the general case, when U(t) is not 
periodic. Introduce the one-parameter family 

( 6 . 19 ) U&) = U(t)(U(2iT))-W” 9 

where the power in t is defined in terms of the spectral decomposition of 
the unitary operator U(2tt) [see (6.5) of Chapter III], 
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by the expression 

1% 2tt 

(U(2n)Y = f e iu d£ h . 

J 0 

We obviously have U^Itt) = 1. Since U(t) commutes with U(27t), 
U(t) U(2 tt) = U(t + 2tt) = U(2tt + t)= U(2ar)U(t) 9 
it also commutes with , and consequently with (U(27t))~^ 2it . Hence 

TO TO = TO TO(TO)-^TO)" V2 " = Ul{h + t 2 ). 

In addition, when t -> t 0 , the weak convergence of U(t) to U(t 0 ) and the 
uniform convergence of (U(27t))~^ 2it to {U{2tt))~^I 271 (see Exercise 6.4) 
imply that 

TO = w-lim TO 

t 

The above results establish that U x (t) is a periodic weakly continuous 
one-parameter group, with period 277, which can be therefore expanded 
in the series (6.15). Consequently, for any f^ge^ 

(6.20) </1 U(t) g y = </1 TOTO)* /27 *> 

= (/ (s-lim £ e™E n ) (U(2ir)yn*g\ 

\ \r~* oo n= _ r / / 

+00 

= £ e^(J\E n {U{2ir)y^g) 

n=—oo 

+ 00 /*2tt 

= E e^(ll2n) e i »‘^d(E n f\£ Ag y 

n =—oo 0 

+ 00 

= E f «“-+*!>* | £ Ai? > 

n =—°o 0 

+ 00 -n+1 

= E f «"»*«/</1 ^Arnrf). 

n—oo J w 

Introduce the spectral function (see Exercise 6.5) 

N(X) -1 

— X + En(\)£\-N(\) 9 _ 


( 6 . 21 ) 
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where N(X) is the greatest integer satisfying iV(A) < A. In terms of Ef, 
(6.20) becomes 

(6-22) </1 U{t)gy = f + “ e^Kf | Efg). 

J —00 

Furthermore, E A is uniquely determined by U(t). In fact, if I is any 
finite closed interval, its characteristic function is the pointwise limit of a 
uniformly bounded sequence of generalized trigonometric polynomials 
(see Exercise 6.6) 


PniX) = £ a nk e ^ iln)kx . 

k=-r n 

Setting 

Pn[U(t)]= £ a nk (U(t)Y^ 

k=-r n 


and using (6.22) to obtain 

I </1 (Pn[m\ - £W/>I < r I P«W - X/(A)| 2 d(J I Eif y, 

j —00 

we conclude with the help of Lemma 3.1 that the above expression 
approaches zero when n -> + oo, and consequently 

E A (I) - s-lim p n [U(t)]. 


Thus, U(t) determines E A (I) completely, and in turn E A (I) determines 
(see Chapter III, Theorem 5.6) E A (B) for any Be#. Q.E.D. 

6.4. Representations of Weyl Relations 

We are now ready to pose the problem of representations of the 
commutation relations (6.6) in terms of one-parameter continuous 
families e iQk \ e iPkt , k = 1,..., n. This reformulation of (6.6) is made 
possible by Theorems 6.2 and 6.3, which will be proved next. 

Theorem 6*2* Two self-adjoint operators A x and A 2 in the Hilbert 
space commute (in the sense of Definition 1.2) if and only if 

(6.23) 

for all t ly t 2 s R 1 . 


e iA z t * = e iA ^ 



340 


IV. The Axiomatic Structure of Quantum Mechanics 


Proof. According to Definition 1.2, A x and A 2 are said to commute if 
and only if 

(6.24) [E A i(B 1 ),E A *(B 2 )]=0, 
for all B 1 ,B 2 e& 1 . 

If (6.24) is true, we can apply Theorem 2.4 to infer (6.23): 

^A-^b giA^b — gi(A^-\-A^)b — gi(A^-\-A^)b — giA^b giA^b 

Conversely, assume that (6.23) is true. To prove that (6.24) is then 
also true, we resort to the construction of E A i and E A * carried out in the 
proof of Stone’s theorem. 

First, we observe that 

[U^Xh), U?Xt 2 )\ =0, fi.f.eR 1 , 

where Ui k \t) is defined by (6.19) in terms of U ik) (t) = e iAkt , k = 1,2. 
This implies that all the corresponding projectors E (k \ n = 0, ±1,..., 
in (6.16) and the spectral functions £ (k) of U {k) (27r) commute with each 
other for k = 1, 2. From this we deduce that the spectral functions E ^ 
and E A * commute. Q.E.D. 

We have seen that (6.11) can be derived from (6.9) only in a heuristic 
manner (but see also Putnam [1967], §§4.6-4.11). However, (6.9) is a 
rigorous consequence of (6.11), as stated in the following theorem. 

Theorem 63* If A x and A 2 are two self-adjoint operators in ^ 
satisfying 

(6.25) e iA ^ = e iA ^ e iA ^i 

for all t lf t 2 e R 1 , then 

(6.26) A 1 AJ = if+A i A 1 f 

for any fe 9 AiAt - AtAl = 9 AyA% n 9 A%Ax . 

Proof. Using Theorem 3.1, Exercise 5.6 in Chapter III, and the 
fact that e iAltl is a bounded operator, we derive from (6.25) 

e iA i t i(iA 2 f) = s-lim — — 1)/ 

f 2 -»0 t 2 

— fe — 1 - I e iA i*i f + s-lim — (e iA zh — 1) e iA i*i f 
dt 2 lt 2 = 0 ** ) J 

= — it x e iA i*i f + iA 2 e iA ihf. " 
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Since because / e ®a x a % > we can again employ (3.6) to deduce 

(note that the existence of the third limit follows from that of the 
preceding two limits!): 


~AAf = s-lim 1 («“.*. - 1 )(iA 2 f) 

= -s-lim — iu f + s-lim(^ 2 ) — (e iA i*i — 1)/ 

*i^o t x 1 J t x ->0 v t x v 

= -if + s-lim(/^ 2 ) j- (e iA ih - 1)/. 


The above relation implies that for all g e 2 Az 

(6.27) <£ | A^J} = Kg |/> + \im(A 2 g | (-i/t^h - 1 )/> 

= Kg I/) + (A % g | A-yfy = I/) + (g I A 2 A 1 fy, 

where the last step is valid due to the assumption that / e @a 2 a x > i* e *> 
A ± f e 2 A . Since g assumes all values in 2 A , which has to be dense in 
because A 2 * exists, we infer that (6.26) follows from (6.27). Q.E.D. 

The above two theorems indicate that the commutation relations (6.6) 
are equivalent to the following set of Weyl relations: 

[exp (iPjUj), exp (iP k u k )\ = (exp(/0^), exp (iQ k v k )\ = 0, 

(6.28) exp(*7>,) exp (iQ k v k ) = exp^^ii^) exp (iQ k v k ) exp (iP jUj ), 

“l U n , V X ,..., V n G OS 1 . 

However, strictly speaking, we know only that (6.6) follows from (6.28), 
where the precise meaning of [| Q k , P k ] = iH is given by (6.2). In any case, 
whenever we say from now on that Q ± ,..., Q n , P x ,..., P n are a represen¬ 
tation of the canonical commutation relations we mean, according to an 
established tradition, that the Weyl relations (6.28) are satisfied. 

The representations of canonical commutation relations are classified 
in two classes. The irreducible representations are those which leave no 
nontrivial closed linear subspace invariant, i.e., for which there is no 
closed linear subspace M ^ {0}, M ^ Jf, such that 

exp(iP k u k )f, exp (iQ k v k )fe M, k = 1 

for all fe M. Any representation which is not irreducible is called 
reducible [an example is provided by (8.76) in Section 8.10]. 

The following theorem was proved rigorously for the first time by von 
Neumann [1931]. 
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Theorem 6A (von Neumann's theorem). All irreducible repre¬ 
sentations of the canonical commutation relations (6.28) on a separable 
Hilbert space are unitarily equivalent. Any reducible representation 
Q k ,P k ,k = 1,..., n, on a separable Hilbert space is the direct sum of 
a finite or infinite sequence of irreducible representations, i.e., is the 
direct sum of a sequence M x , M 2 of mutually orthogonal closed linear 
subspaces and 

exp (iP k u) = £ exp(iP<£>«) E„ a , exp (iQ k v) = £ exp(*0j>“>©) E Ma , 

a a 

where Qi a) ,..., Qn\ Pi a) ,..., Pn ] are irreducible representations in M a . 

It will be shown at the end of this section that the Schroedinger 
representation, in which Q k and P k are given by the operators (4.19) and 
(4.20) of Chapter III, is an irreducible representation in L 2 (IRH). This 
establishes the existence of representations of (6.28) for n — 1, 2,... . 

In quantum field theory, or in any quantum mechanical theory of 
systems in which the number of particles can vary and become arbitrarily 
large, one is concerned with representations of the canonical commutation 
relations for an infinite number of degrees of freedom, i.e., for an infinite 
sequence of operators Q x , P x , Q 2 , P 2 >••• • ^ is worth mentioning 
—though this case lies beyond the scope of the present book—that von 
Neumann’s theorem cannot be generalized to this case. In fact, there is 
an infinite number of irreducible unitarily nonequivalent representations 
of an infinite number of canonical commutation relations [Garding and 
Wightman, 1954a, b]. 

*6.5 Appendix: Proof of von Neumann’s Theorem 

We shall prove in detail von Neumann’s theorem only for the case 
n — 1, i.e., when we are dealing with two canonically conjugate operators 
Q and P which satisfy the Weyl relation 

(6.29) giPu giQv _ gifiuv giQv giPu ll V E [R* 

This case contains all the essential features of the more general case. The 
lengthy proof of this theorem, which occupies the rest of this section, 
can be ignored at the first reading. 

Let us introduce the two-parameter family 

(6.30) S(u, v) = e iP ” e 

of unitary operators. It is easy to verify that 

(6.31) S(u x , S(u 2 , v 2 ) = exp[(*72) h(u 1 v 2 — ^ 2 )] S(u t + u 2 , v t + v 2 ) 
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by using (6.29). Taking in the above relation u = u x = — u 2 and 
v = v x = —v 2 , and noting that S( 0, 0) = 1, we obtain the identity 

(6.32) S(—u, —v) = S*(u, v) = S~\u, v), u,ve U 1 . 

If p(u , v) is a function which is Lebesgue integrable on R 2 , the integral 

(/1 g\ = f ®)</ I S(u 9 v)g> du dv 
J os 2 

is defined for any/, g e since </ | S(u , v)g ) is continuous and bounded 
in u, v e U 1 (see Chapter II, Theorem 3.9). It is easy to check that 
(/1 S)p * s a bounded bilinear form, and consequently it follows from 
Riesz’ theorem (see Chapter III, Exercise 2.5) that there is a unique 
bounded linear operator A p satisfying </1 A p g} — (/ | g) p . The function 
p(u , v) is called the kernel of A p . It is straightforward to verify by direct 
calculation, employing the relation 

(6.33) </1 A p g > = f P (u, v)(f | S(u, v)g) du dv , 

J or 

that the following statements are true: 

(6.34) A p * = A p , p(u, v) = p*(-w, -v), A Pi+P2 = A Pl + A Pz , 

where p , p ± , and p 2 are any functions which are Lebesgue integrable on 
U 2 . Moreover, we easily deduce that 

A PlP% A 0 9 

(6.35) 

p(u, v) = exp[(**/2) h(uv 2 — vu 2 )\ p ± (u — u 2 ,v — v 2 ) p 2 (u 2 , v 2 ) du 2 dv 2 , 

J K 2 

by the following procedure: 

</ I A pAp 28 > 


= <A p *f\A P2 g> 

= f . p 2 («2 , v 2 XA*f I S(u 2 , v 2 )g) du 2 dv 2 
J ur 

= (" , fts(«2 . ®»)</1 A oyS(u 2 , v 2 )g) du 2 dv 2 
J Or 

= f du 2 dv 2 p 2 (u 2 , v g ) I du! dv x | 5(m x , vj S(u 2 , v 2 )g) 

J R* J R* , 

(equation continues) 



344 


IV. The Axiomatic Structure of Quantum Mechanics 


= f „ Pi(«i, ®i) Ps(« 2 , « 2 ) exp[(t/2) - tJjttjj)] 

•'[F & 4 

X </1 Sfa + u 2 ,v 1 + z; 2 )£> d u i dv i ^ 2 dv 2 
= I /°i( M —u 2 ,v — V 2 ) p 2 (u 2 , v 2 ) exp[(**/2) #( mz; 2 — «w a )] 

J D« 4 

X </1 S(u, v)g) du dv du 2 dv 2 

= | du dv </1 S(u, v)g) I du 2 dv 2 p ± (u — u 2 ,v — v 2 ) 

J D3 2 ^ M 2 

X / 0 2 (m 2 , v 2 ) exp[(**/2) #( mz; 2 — aw 2 )] 


in which Fubini’s theorem has been used twice. 

We need now the following lemma. 

Lemma 63* ^4 P = 0 if and only if its kernel p(u^v) vanishes almost 

everywhere. 

Proof. A p = 0 implies that S(—u', —v’) A p S(u' f v') = 0 for any 
u\ v' e U 1 . We obtain, using (6.31) in the process, 


</| S(-u',-v')A p S(u', v')g} 

= I p(u, v)(f | S(—u, —v') S(u, v) S(u', v')g > du dv 

= p(u, v) exp [(ijh)(uv f — vu')](f \ S(u, v)g)> du dv. 

J R 2 

Letting u'jlTrlfi and vflTrlfi assume integer values, we infer that 
(6.36) f F(u, v) p(u, v)(f | S(u, v)g > dudv = 0 

J D5 2 

for any trigonometric polynomial F(u , v) of period l. Since p(u , v) is 
integrable, and </| S(u , v)g ) is integrable and bounded, we can take 
limits and generalize (6.36) to the case when F(u, v) can be any continuous 
function of period / > 0 (see Exercise 6.6). Thus, we deduce (see 
Exercise 6.8) that for any fixed fygeJ^y </1 p(u, v) S(u, v)g s ) = 0, 
except on a set R(f , g) of measure zero. If {^} is a countable orthonormal 
basis, we conclude that < e m | p(u , v) S(u , v) ef) — 0 except on the set 
R = (Jm,n R( e m > e n)> which is of measure zero. Hence, p(u, v) S(u y v) = 0 
outside R f i.e., p(u f v) = 0 almost everywhere. Q.E.D. 

It follows from Lemma 6.3 that the operator A = A Pq , where 
p 0 (Uy v) = exp[— (#/4)(w 2 + v 2 )], is different from zero. Moreover, 
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it is easily verified by using (6.31) and (6.33) that AS(u, v)A has the 
kernel 

/ eX P j! [K U 1 V 2 — V l u i) — K«1 — U zY — — v 2f 

+ i(uv 2 — vu 2 ) — i(u — u 2 ) 2 — £(© — w 2 ) 2 J | du 2 dv 2 
= exp [— ^ (n 2 + v 2 + ttj 2 + © t 2 )] J exp ^ (u 2 + © 3 2 )] du s dv 3 

= ¥ exp [~ \ ^“ 2 + ^ 2 )] ex P [ _ \ (“i 2 + ®i 2 )]- 

This implies that 

(6.37) AS(u, v)A = ~ exp [- | (n 2 + v 2 )]a. 

In particular, for u = v = 0 we obtain A 2 = {In/ft)A. In addition, we 
can derive from (6.33) that A = A*. Thus, ( fijl^A is a projector. 

Denote by M the closed linear subspace of onto which ( HI27t)A 
projects. For any /, g e M we obtain by using (6.31) and (6.37) 

(6.38) 

<S(u i, v ± )f | S(u 2 , v 2 )g} 
h 2 

= 4^2 < S ( U 1 > V l) A f I S ( U 2 > V *) A g> 
h 2 

= 4^2 </ I i4S '(—“l > —®l) ^( M 2 , ® 2 ) 

= 4^2 ex P [-y (®i«a — «i®s)J</1 AS(u 2 -u 1 ,v 2 — ©J 4?> 

= ^ exp [- ^ («2 — «i ) 2 — | (®2 — ®i ) 2 - y («1®2 - ®i«a )]</1 4 ?> 

= exp [- ^ («2 - Ml) 2 — | (©2 — ®i) 2 - y (« 1®2 — « 1 «s)]</1 £>• 

Let {e x , e 2 be an orthonormal basis in M, and denote by M a the 
closed linear subspace spanned by all the vectors S(u , v) e a , u , ^ e R 1 . 
We easily deduce from (6.38) that the subspaces M a are mutually 
orthogonal. 
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It follows from (6.31) that, for all u, v e R 1 , S(u , v)fe M a if fe M a . 
Denote by ^ the orthogonal complement of @ a M a . By virtue of (6.32), 
for any fe = @ a M a and g e , we have 

</| S(u , v)g> = < S*(u, v)f\g > = <S(—u 9 -v)f\g> = 0 

because S(—u,—v)fe^f > o f° r all u * v e On the other hand, it 
follows from the construction of M a that @ a M a D M, where M had been 
defined to be the subspace onto which {ftl2?r)A projects. Consequently, 
Af = 0 for all fe . But, this implies that ^ = {0}, i.e., J4? = @ a M a . 
As a matter of fact,^we have just proven that S(u , v) leaves invariant, 
and therefore, if ^ ^ {0}, we could apply Lemma 6.3 to the Hilbert 
space and to the restriction of S{u , v) to , and deduce that the 
restriction of A to ^ is not zero. 

It is clear from the construction of M a that S(u , v), and therefore 
e iPu an( j e iQv f u, v e R 1 , do not leave any nontrivial subspace of M a 
invariant. Hence, e iPu E M and e iQv E M are irreducible representations of the 
canonical commutation relations in M a . Thus, the first part of von 
Neumann’s theorem is proved. 

Consider now two irreducible representations P, Q and P', Q ' in . 
In view of the above construction, M and M' are spanned by the single 
vectors e a and ef, respectively, and the sets of vectors f u v = S(u , v) e a 
and f' UtV = S\u , v) ef are dense in Introduce the operator U , which 
is defined on f u>v by Uf u v = f' UtV and then extended to the entire Hilbert 
space Jf. It will be shown that U is unitary. 

By virtue of (6.38) 

</«„«, I /«,.«,) = exp [- ^ (u 2 - u x f - ^ (w 2 - Vyf - y {UjV 2 - «!M 2 )] 

= <u ,v x l/tt 2 ,® 2 X 

so that || Uf u>v || = \\f u>v ||. Thus, U can be extended to the entire Hilbert 
space by the extension principle (see Chapter III, §2), and the result is an 
isometric operator. Moreover, since the set {fu, v : u,ve U 1 } is dense in 
the range of U is and U is unitary. 

It follows from (6.31) that 

S(u, v)f u 'y = exp[(*/2) h(uv — vu')]f u ' +UtV ' +v , 

S\u, v)f' u ' tV ' = exp [i(ftl2)(uv' - vu')]fi' +u y +v . 

Consequently, we obtain the relation 

U-'S'iu, v) Uf uW = U-'S'(u, v)fl-j 

= exp [i(ftl2)(uv' — vu’)}f u '+ UtV '+ v = S(u , v)f u 'y , 
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which can be extended straightforwardly to Jf, so that 

S(u, v) = U-'S'iu, v)U. 

Since S(u , 0) = e iPu , S( 0, v) = e iQv , etc., we arrive at the result that 
e ipu — u-i e ip'ujj an j e iQv _ U-i e i Q 'vjj. This concludes the proof of 
von Neumann’s theorem. 

The proof of the theorem for the more general case of 2 n operators 
Pi y—yPn proceeds along identical lines, S(u, v) being 
replaced now by 

S(«l v, “n , ®n) 

= exp[—(t/2) *(«!©! + •“ + M n ^n)] eXpD'CF>l + — + ^n)] 
x exp[f(6i©i 4-h QtPn)]- 

In the case where Q and P constitute the Schroedinger representa¬ 
tion in L^R 1 ), we have (see Exercise 6.9) 

( 6 . 39 ) (e zPu ijj)(x) = ifj(x + hu), ( e tQv ifj)(x ) = e txv ifj(x). 

Consequently, we easily compute that 


(S(u 9 v)ip)(x) = exp [iv(x + \hu)] i/j(x + hu), 

(6.40) 

(Aifj)(x) = f exp[—(#/4 )(m 2 + v 2 )] exp[w(x + \hu)] ifj(x + hu) du dv 

J n » 2 



| (x + $*«)*] <K X 


hu) du 


£L exp [ _ i ( * ,+<,) ]* w ‘ 


Hence, the equation Aifj — (.Info)ifj has only one linearly independent 
solution, namely ift 0 (x) = e~ (1 > i.e., M is one dimensional. This 

establishes that the Schroedinger representation is irreducible. 


Exercises 

6*1* Verify that the commutator [A, B\ = AB — BA of linear 
operators defined everywhere on has the following properties: 


[A,B] = ~[B,A], 

[ aA, B] = \A, aB ] = a[A, B ], 
[A, BC ] = B[A, C ] + [A, B]C . 
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6*2* Derive (6.10) from (6.9) by induction, using the results of 
Exercise 6.1. 

6*3* Show that s-lim^ <o e iAt = e iAt °. 

6*4* Show that if U is a unitary operator, u-lim^ U l = [/*». 

6*5* Prove that E A defined in (6.21) is a spectral function. 

6*6* Let/(A), A e R 1 , be a continuous function of compact support. 
Prove that there is a sequence of trigonometric polynomials p n ( A) = 
Hk a nk exp[(2 Tr/ti) ik A] which converges pointwise to /(A). Explain also 
why the same statement is true when/(A) is the characteristic function of 
a finite interval. 

6*7* Verify that the relations (6.34) are true. 

6*8* Show that if p(x) is integrable on U n , and J^ n F(#) p(x) dx = 0 
for any continuous function F(x) of compact support, then p(x) — 0 
almost everywhere. 

6*9* Derive relations (6.39) and (6.40) for (Q$)(x) = xi[f(x), 
P = U^QUp , where U F is the Fourier-Plancherel transform in L^U 1 ) 
(see Chapter III, Eq. (4.20) and Theorem 4.6). 

7* The General Formalism of Wave Mechanics 
7.1. A Derivation of One-Particle Wave Mechanics 

In the preceding sections we outlined a very general framework for 
quantum mechanics, which is contained in Axioms O, S (or H), P, and C. 
In practice, more specific formulations of this formalism are usually 
required. Such formulations are achieved by making other assump¬ 
tions in addition to Axioms O, S, P, and C. These assumptions reflect the 
peculiarities of the particular class of systems under consideration. 

The assumptions most frequently made are those leading to the 
formalism of wave mechanics. It is believed that these assumptions apply 
in the nonrelativistic domain to all microscopic systems which can be 
considered to consist of a finite number n of stable particles (i.e., of 
particles whose nature does not change during the duration of the 
experiment). 

The fundamental system in nonrelativistic quantum mechanics is the 
microscopic quantum-mechanical particle (which we have in mind 
whenever we talk about a “particle”). The following requirement reflects 
the basic nature of the concept of a particle. 
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Axiom WL The set { Q {x) , Q {y) , Q (z) > S n } of the position observables 
Q (x) , Q (v) , Q {z) , and the spin projection *S n in a given direction n, constitute 
a complete set of observables of any one-particle system. 

The above theoretical assumption reflects an interpretation of certain 
experimental facts which determine the empirical nature of a particle. 
In other words, a particle can be defined from the experimental point of 
view as an empirical object to which measurements of position, 
momentum, and spin can be related. 

The observables Q {x) , Q {v \ and Q (z) correspond, naturally, to the 
measurement of the x, y, and z coordinates, respectively, of the position 
of the particle in some inertial Cartesian frame. It has become customary 
to choose the z axis of this frame to point in the direction n in which the 
spin projection is measured (i.e., the direction H in Fig. 3 in the Intro¬ 
duction). When this is the case, we denote the observable S n by S {z) . 

Axiom W1 implies the existence of a spectral representation space 
L 2 (R 4 , ft ) of the complete set {Q kx \ Q (v \ Q {z) , *5"}, whose elements are 
represented by ^-square-integrable functions 

(7.1) 0(*, y, z, s), f | 0(*, y, z, s)| 2 dfi(x 9 y, z , s) < +oo. 

It does not tell us anything, however, about the nature of the measure /jl . 
In order to decide which measures p we have to choose, we have to 
invoke two additional axioms, which will be given next. 

The following axiom reflects the experimental fact that the spin 
projection of a particle of spin o in some direction n can assume only the 
values —a, —(7+ 1,..., +(7 (see the discussion at the end of the Intro¬ 
duction). 

Axiom W2* The spin projection observable S n of a one-particle 
system of spin o has the pure point spectrum 

(7.2) S a = {—a, — <7 + 1, — a + 2,..., ct}. 

In the spectral representation spaceL 2 (R 4 , /jl) of { Q {x) , Q {v \ Q (z \ *5"}, the 
operator S n has the spectral representation 

(S n if*)(x, y, z, s) = si/f(x, y, z, s) 

so that 


(E S \B) 0 )(*, y, z 9 s) = X b(s) <A(*, y, 4 B e & 1 . 
Take a function of the form 

0(*. V> *> s ) = XB&, y, z) X b 2 (s), 
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where B x and B 2 are bounded Borel sets in M* and R 1 , respectively. Such 
a function obviously belongs to L 2 ((R 4 , p). If B C B 2 , we have 

II E sa (B) <l> II 2 = f Xb^x, y, z) Xb(s) M x , y, z, s) 

J R 4 

= /x(B x x B). 

Thus, if B n S„ = 0, we get 

M (B 1 x J B)=||£ s ”(B)^|p = 0. 

Moreover, ^"({s}) — 1 for s e S CT . Hence, we can write 
(7.3) ^XB)= I X {s}), 

seBnS a 

where the sum is always finite, since by the assumption in (7.2) S a is a 
finite set. 

Relation (7.3) has been established by the above argument for the case 
of bounded B x e 88* and B e 38 1 , but it can be extended also to unbounded 
Borel sets by using the continuity from below of measures to derive for 
arbitrary B x and B 

fitfr X B) — lim ^,(B[ n) X B m ), 

n~*oo 

B= B 1 n {(#, y, #): — n < x < n, —n < y ^ n, —n < z < «}, 
B {n) = Bn [-n, +«]. 

In view of the structure (7.3) of the measure fi, we find that the inner 
product <• | •) inL 2 (IR 4 , p) is of the form 

<0i 10 2 > = f 0 i*(x, y, *, *) 0 2 (*, y, *, *) M x > y> z , s) 

J orxs a 

+CT - 

= 11, ^i*{x,y,z,s)4> i {x,y,z,s)d t x{x,y,z,s), 

s=—a J DTx{s} 

due to the fact that 

f . >Pi*( x , y, z, s ) y, z, i) d/x(x, y, z, s) = 0. 

J R 1 -(R 3 xS a ) 

If we introduce the measures 

Hs(B) = X {*}), * = —+ l.-> <*, 
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on <^ 3 , we can write 


+o - 

(7.4) <«/-! | 0 2 > = E , y> *> s ) <A 2 (*, y, «, s) dfj. s {x, y, z). 

s=-a J K 3 


However, without any further assumptions we cannot determine in more 
detail the measures ^, S (P), J* 3 for s = — ct,..., +ct. 

The extra postulate which we need for determining fju s follows. 

Axiom W3* The self-adjoint operators P {x) , P (2/) , and P {z) represen¬ 
ting the momentum component observables of a one-particle system 
commute with the spin operator S n . In addition, the operators Q (x) , Q iv \ 
Q (z) , P ix \ P iy) , P iz) satisfy canonical commutation relations in which Q {x) 
and P {x) , Q iv) and P (2/) , Q {z) and P (0) are canonically conjugate. 

In order to be able to take advantage of Axiom W3, we note that (7.4) 
implies that the mapping 


y, z, s) -> if* s (x , y, z\ s = —a, —a + 1,..., a, 

of L 2 (R 4 , ft) onto @g„ CT L 2 ((R 3 , /jl s ) establishes a unitary equivalence 
between these two Hilbert spaces. Since we have 


(Q (x) if*)(x , y, z, s) = xif*(x, y, z, s) = xifj s (x , y, z\ 

and similar relations hold for Q (v) and Q (z \ it follows from von Neumann’s 
theorem (Theorem 6.4) that fju s have to be Lebesgue measures on J 13 , and 
that for differentiable functions ip(x f y f z f s) y ip E , 


(P (x) ip)(x, y , z, s) = —ih 


dip(x,y, z, s) 
dx 


with corresponding relations holding for P (v) and P {z) . 


7.2. Wave Mechanics of //-Particle Systems 

On the basis of the above discussion we have concluded that the 
spectral representation space of the set { Q {x) , Q (v} , Q {z \ £ n } constituting, 
according to Axiom Wl, a complete set of observables of a single particle 
system of spin ct, is the space L 2 (R 4 , /x), where /jl is the measure satisfying 

KBl x {s}) = ^ 3) (Pi), ^ = — (J, —or + 1,..., or, 

/x(P x x B 2 } = 0, B 2 n {—cj,..., + <j} = 0, 

and /x\ 8> (jB 1 ) is the Lebesgue measure of B x , B x e& z . We have also seen 

that a state </r can be represented at any instant by an element of the 
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direct sum @ S L 2 (IR 3 ) of (2o + 1)L 2 (R 3 ) spaces. Wave mechanics is 
ordinarily formulated in this @ S L 2 (IR 3 ) space, in which a state vector is a 
wave function 


z , s), x,y,ze U\ s = —a, —a + 1,..., + o, 

which is Lebesgue square integrable in the x, y , z variables for each fixed 
value of s from the set S CT = {—ct,..., +<r}. 

By means of Axiom P in §4 we can generalize the above results to the 
n-particle case. 

Suppose that we are dealing with a system S of n different particles of 
spins Oj ,..., (J n , respectively. Then according to Axiom P, we can 
associate with S the Hilbert space 

(7.5) *i = [©L 2 (R 8 )] 0 - 0 [®L 2 (R 8 )], 

S 1 s n 

where the Mi factor, k = 1,..., n, is a direct sum of (2 o k +1) L 2 (IR 3 ) 
spaces. The linear mapping U 1 , which assigns to a vector of the form 

(7.6) (0^ 0-0 0^,) 0-0©it0-0 0^), 0",eyflP) 
from the vector 

(7.7) © - © (<0-00 s < : ) ) 

s l =_or l s n- a n 

from the Hilbert space 

(7.8) ^ 2= @...@[ LW] ®n^ 

Si s n 

determines a unitary transformation of onto ^ (see Exercise 7.1). 
Moreover, by Theorem 6.9 in Chapter II, there is a unitary transforma¬ 
tion of [L 2 (R 3 )]® n onto L 2 (R 3n ), which maps p{x x , y\ , , $i) ® **• @ 

>P( x n >y n >z n > s n ) into ijj(x 1 , y-i., , *i) • •' >K x n ,y n > z n> s n)- By carrying 

out this transformation of each one of the terms [L 2 (R 3 )]® n of the direct 
sum (7.8) intoL 2 (R 3n ) we arrive at a unitary transformation (see Exercise 
7.3) of the Hilbert space 34?% into the Hilbert space 

(7.9) 34? =® •••©L 2 ((R 3 ”). 

S 1 s n 

The elements of 34? (called the configuration representation space) are, 
for fixed s ± ,..., s n , square-integrable wave functions 

(7.10) ifj(x i, y ±, z ^, Sj,..., x n ,y n , z n , s n ), s% = o^.,..., -f~ cr^.. 
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Thus, U = U 1 U 2 is a unitary transformation of onto mapping 
a vector of the form (7.6) into 

(7.11) ^ (1) (*l , ?! , *1 , *i) • • • , S„). 

The ^-coordinate position operator in for the kth. particle is 

Qlfi'" ® 0 k( x k ? J'fc J z k J s k) @ ***) — *** ® ( x k^k{ x k >2?k > z k > s n )) ® ***? 

and corresponding expressions hold for Q^ and Q^\ It is very easy to 
check (first for vectors in represented by functions of the form (7.7), 
and then generalize the result) that the ^-coordinate position operator 
in of the &th particle 

Ql x) = UQ^U~\ k = \ 

is given by (cf. Definition 5.2 of spectral representation spaces) 
(£?lfV)(-> ** », ** , ^ »•••) = x k <f>(..., x k ,y k ,z k , s k 


Naturally, similar expressions hold for Q^ and Q Jf\ 

An analogous consideration easily yields that the momentum operator 
Pcanonically conjugate to Q Jf* satisfies the relation 


(PjT0)(-, x k ,y k ,z k , s k ,...) = — i 


x k ,y k ,z k , s k ,...) 
dx k 


when P^ c) is acting on differentiable functions from the domain of 
definition of PjfK 

The above considerations cover the case of w-particle systems 
containing no identical particles. In the more general case when identical 
particles are present, the Hilbert space of the system will be a closed 
linear subspace of the Hilbert space in (7.9), consisting of all appropriately 
symmetrized (see Definition 5.1) functions in (7.9);i.e.,the wave functions 
of the system will be symmetric with respect to position-spin variables 
(r, s) corresponding to particles of integral spin (commonly called 
bosons ), and antisymmetric with respect to variables corresponding to 
particles of half-integer spin (commonly called fermions) * 

The entire discussion of this section can be duplicated with the 
complete set (see Exercise 7.5) { P {x) , P (2/) , P (e) , S n } instead of {Q (x \ Q (v \ 
Q {z) , S n }. The procedure leads then to the same Hilbert space (7.9), 
except that now the state vectors of this space are wave functions 


Y\Pl 9 Pi j Pi > Pn > Pn > s n)> 


Sir. - -O’fc 


# See also Exercise 7.4. 



354 


IV* The Axiomatic Structure of Quantum Mechanics 


in momentum variables (so that as the spectral representation space 
of {Pfc , S k n : k = 1,..., n) is called the momentum representation space) 
and the momentum operators P k x) have the representation 

s k ,...) = pi z \ s k 


with corresponding relations holding for P^ ] and Pjf ] . The almost 
complete symmetry of the problem under the interchange of position and 
momentum variables (note that [ Q (x) , P (aj) ] = — [P {x) , Q {x) ], etc.) makes 
it obvious that Qjjf* is represented now by 


Ax) Av) Az) o \ _ Pk*\ p^\pk\ s k >•••) 




when acting on differentiable functions from its domain of definition; 
similar relations hold for Q^ and 


7.3. The Schroedinger Operator 

In any picture of quantum mechanics (see §3) we postulate for every 
quantum mechanical system the existence of a Hamiltonian H , which 
plays the dual role of determining the dynamics of the system and 
representing the total energy observable. However, none of the general 
axioms provides any clue about the more specific form of H. 

In nonrelativistic quantum mechanics one normally approaches the 
problem of finding H of a particular system of particles by looking at that 
system from the classical-mechanics point of view. Thus, one chooses a 
classical model with a potential V from which all the forces acting between 
the particles of the system can be derived. The transition to wave 
mechanics is made by writing the Schroedinger operator for that system. 
The hidden assumption contained in this procedure is the following. 

Axiom W4* The Hamiltonian H s of a system of n spinless particles 
is a self-adjoint operator which maps elements p e , given in the 
configuration representation space by twice everywhere continuously 
differentiable functions, into 

[-££( 

+ V ( x 1 , y 1 , z 1 x n , y n , *„)] i/^i, y{~, *1 x n , y n , z„j 
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whenever the above expression represents an element of H s is the 
only self-adjoint operator which maps all such elements iff e into 
elements representable by (7.12). 

An operator H s satisfying the above conditions is called a Schroedinger 
operator. 

For particles with spin, H s contains in general terms which depend 
on the spin-component operators. Since spin operators are obviously 
bounded, the results derived later in this section are easily generalized 
to that case. Sometimes even “velocity-dependent” potentials, containing 
first derivatives in the position variables, i.e., depending on the 
momentum operators, are used. Their treatment might require special 
attention, and the reader interested in such cases is referred to the 
literature quoted at the end of this chapter. 

We note that Axiom W4 imposes constraints on the functions which 
can play the role of potentials. A potential which has too many 
singularities, or some singularities that are too strong, might not satisfy 
the second requirement of Axiom W4. 

7.4. Closures of Linear Operators 

The main reason for not treating the Schroedinger operator exclusively 
as a differential operator is that differential operators given by means of 
differential operator forms are ill defined, since their domains of defini¬ 
tion are ambiguous. However, one hopes that when defining a linear 
operator on a dense domain by means of the Schroedinger operator form, 
one will be able to extend that operator uniquely and in a natural way 
to a self-adjoint operator. This goal is usually achieved by taking the 
closure of that operator. 

Definition 7*1 ♦ The linear operator A is the closure of the linear 
operator A if it is the minimal closed extension of A , i.e., every closed 
operator C satisfying C3 A also satisfies CD A. 

The existence problem of the closure of an operator is treated in 
Theorems 7.1 and 7.2. 

^Theorem 7*1 ♦ If the linear operator A in the Hilbert space 
has an extension which is a closed operator, then it has a minimal 
extension A, i.e., its closure exists. 

Proof. Let A 0 be some given closed extension of A. Define the 
operator A in the following manner: the vector / is in the domain of 
definition of A if and only if / e & Aq and for any e > 0 there is a vector 
g G A such that 
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When / e Of A , we define 

Af = AJ. 

The so-defined operator A is certainly linear, since if f x , / 2 e and 
a x ^ 0, fl 2 ^ 0, there are, for any given e > 0, vectors g x , g 2 e such 
that 


11 ^o/*- - II < 2tVr ’ *' =1 ’ 2 - 

Consequently 

II a ifi + ^ 2/2 - (“igi + « 2 ^ 2 )ll < II A 0 ( a ifi + ^ 2 / 2 ) - A(a±g x + 0 ^ 2)11 < e, 
which implies that a x f x + a 2 f 2 e@> A and 

a ifi + ^ 2 / 2 ) = ^oK/i + ^ 2 / 2 )- 

Finally, the operator A is closed. To see that, note that if A , / 2 e Of A 
converges to/e and Af x , Af 2 ,... converges to/, then for any integer 
n there is a g n e such that 

ll/n - gn II < l/«, II A 0 f n — Ag n || < 1 /«. 

Consequently, if \\n < e/2 and 

||/-/ n ||<e/2, 11/ Af n || < e/2, 

we get by combining the above two inequalities and applying the triangle 
inequality the following: 

II/ gn II < € > 11/ Ag n || < €. 

This result, in conjugation with the fact that A 0 is closed (and therefore 
/ = A 0 f) yields that A is closed too. It is easy to infer from the way in 
which A was constructed that any closed extension of A contains A . 
Consequently, A is the minimal closed extension of A . Q.E.D. 

Theorem 7.1 guarantees the existence of the closure I of a linear 
operator A only in those cases when it is known that A has a closed 
extension. However, symmetric operators A with = Jf 7 have A* 
as an extension, and since A* is closed (see Chapter III, Theorem 2.8), 
A exists. 

^Theorem 7*2* If A is a symmetric operator densely defined in 
Jf, its closure exists and is identical to A**. 
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Proof ’. Since A** is the adjoint of an operator, it is closed. Moreover, 
A** D A, so that A has a closed extension. Consequently, A exists and 
ACA**. 

The relation A C A ** implies A* 2 A*** = A * (see Exercise 7.6), 
and therefore A* is densely defined and has an adjoint A **. We shall 
prove that A ** == A. 

Both A and A ** are closed linear operators and consequently their 
respective Cayley transforms V and V x are closed also (see Chapter III, 
Theorem 4.7). Moreover, AC JT** implies that VCV 1 . 

We recall that the domain of definition of V is closed and identical 
to ^A+i . Thus, © S) v consists of all vectors satisfying 

<(A+i)g\f>=0 

for all g e . Since the above relation is identical to 

<Ag I/> = ge®A, 

it follows that fe © &J V if and only if A*f = if. 

Since A* = A *** == (JT**)* (see Exercise 7.6), it follows that 
© Q) v = , i.e., S) v = 2 V . Consequently, the relation 

VCV 1 holds only if V_= Vj , i.e., A = A**. 

Using the result A == A**, we obtain from A C A C A** that 
A* D A * D A*** A*. By taking adjoints in the preceding relations 

we get A** C A C A**. Thus we have arrived at the desired result 
A ** s= A. Q.E.D. 

7.5. The Schroedinger Kinetic Energy Operator 

By choosing in (7.12) V(x 1 , y x , z x ,..., x n , y n , z n ) =0 we obtain the 
Schroedinger operator form for the kinetic energy operator 

(7.13) (T s if*)(xi ,y 1 ,z 1 x n ,y n , z n ) 

- _ *!. v J_ i 

_ 2 fc=i m k ' &*** ^fc 2 / 

x 0(^1 , yi , #1 x n,y n , *n)- 

In order to be able to treat T s as a differential operator in L 2 (R Sn ), we 
require that each \fs eL 2 ((R 3n ) to which T s is applied be representable by a 
twice differentiable function with square integrable second derivatives. 
An operator T s defined in this fashion would be symmetric but not self- 
adjoint. It will be shown, however, that such an operator is essentially 
self-adjoint. 
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Definition 7*2* The symmetric operator A in is said to be 
essentially self-adjoint if its closure A is self-adjoint. 

According to Theorem 7.2, ^4** is identical to the closure A of A. 
Since (^}**)* ~ A* = A ** (see Exercise 7.6), we see that A is essentially 
self-adjoint if and only if A* = A **. 

^Theorem 1*3* The symmetric operator T 0 , defined on the family 
of all elements/eL 2 (IR 3n ) representable by functions 

(7.14) f(r 1 r„) = P(r x r„) exp [— ^ (r t 2 -f ••• + r„ 2 )] 

corresponding to all choices of polynomials P(t 1 ,..., r n ), and acting on 
/ E according to 

WX**•«) = - y ( E ,..., r„), 

/c=l k 

is essentially self-adjoint. Its closure T 0 is identical to the operator 

(7.15) = £ dr {p « x) * + p * v) *+ p ^- 

k =1 k 

Proof. The operator T s is a function 

(7.16) T s = K( P, P„); K( P x P„) = £ P fc «/2», , 

fc=l 

in the sense introduced in Theorem 2.5, of the commuting self-adjoint 
operators P^\ Pjf \ P { ^\ k = 1,..., n (see Chapter III, Theorem 4.11). 
Therefore, from Theorem 2.7, and in particular (2.26), we deduce that for 
/e^ Ts , T s acts on the Fourier-Plancherel transform / of / in the 
following way (see also Exercise 7.7): 

(7.17) 0 V) (Pi Pn) = ~i~ P* 2 /(Pipj- 

^ fc=l rn k 

Now, is invariant under the Fourier-Plancherel transform U F (see 
Exercise 7.7), i.e., t/ F ^ 0 = , so that if /e , then /is representable 

by a function of the form 


T(pi v, p„) exp[-(2^)" 1 (pi 2 + *•• + p n 2 )]- 

It follows that Tg/ is defined by (7.16) when fe , i.e., C ^ rg . 
In view of the fact that is dense in L 2 (R 3n ) (see Lemma 5.2), the set 
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@ Ts [consisting of all /eL 2 ((R 3n ) for which the function on the right- 
hand side of (7.17) is square integrable] is dense in L 2 ((R 3n ). Thus, rp 8 
is a real function of self-adjoint operators and a densely defined operator. 
Therefore, T s is self-adjoint by Theorem 2.6. 

It is easy to see that T 0 C T s . As a matter of fact, we have 

(^o/Xri r n ) 

= (~~r £ (2wo 3 *' 2 pn) 

X exp[(«/^)(p 1 • r x + ••• + p« • !•„)] dp 1 ■■■ dp n 

1 1 c n i 

= 2 (2tt£) 3m/2 “ J K 3„ £ ^7 Pfc2/(pl Pm) 

X expt^Xp! * r x + ••• + Pn • r n )] dfr — dp n 
= (T 0 f)(rir n ) 

since for/e C of the form (7.14) the order of differentiation and 
integration can be interchanged by Lemma 3.1 and the mean value 
theorem of differential calculus. 

We shall prove now that T s is the closure of T 0 . Let us show first that 
(1 + T 0 ) % is dense in L 2 (R 3n ). 

Assume that £eZ, 2 ((R 3n ) is orthogonal to (1 T 0 )@ 0 . Since 

U F @ 0 = SJq , this assumption implies that 

(7-18) f |(Pip n )[l + K ( Pl ,..., p n )] P(p 1 ,..., p n ) 

* , 05 3n 

x exp[—(2^)- 1 (pi 2 + ••• + p w 2 )] dp ± ••• dp n = 0 


for all choices of polynomials P(p x ,..., p n ). The function 

(7.19) 

= S (Pl Pn)[l + ^(Pl Pn)] eXp[-(4«)- 1 (p 1 2 4- — + Pn 2 )] 

is square integrable, and according to (7.18) it is orthogonal to all 
functions of the form 

( 7 . 20 ) P{p x ,..., p„) exp[-( 4 *)" 1 (Pi 2 H-h p„ 2 )]. 

By Lemma 5.2 the family of all functions (7.20) is dense in L 2 (R 3n ). 
Therefore, we must have fi(p 1 ,..., p n ) = 0 almost everywhere. Since 


[[ + K(p t ,.... p n )] exp[~(4^)" 1 (pi 2 -I-b pn 2 )] > 0, p t ,..., Pn € M 3 , 
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we must have g( p x p n ) = 0 almost everywhere, i.e., g = 0, and 

(1 + T 0 ) % is dense inL 2 (IR 3n ). 

Let us take any f e . Since (1 + T 0 ) is dense inL 2 (IR 3n ), there is 

for any e > 0 a vector f e e such that 

||(1 + T s )f-(l + T 0 )f'\\<€, 

i.e., in view of T 0 C T s , 

11(1 + T s )(f-f e )\\<e- 

It is easy to see from the relation 

11(1 + W-/J 2 = f 3 I[1 -^(Pl ?•••? Pn)](j^ /e)(Pl v? Pn)| 2 ^Pl ^Pn 

J K 3n 

that since K( p x ,..., P n) > 0, 

II/-/.II <11(1 + 

II T s f - TJ € II = II T s (f -/,)II < ||(1 + T s )(f -/J < 6, 

and consequently T s C T 0 . However, T s is self-adjoint, and therefore 
closed, and since T 0 C T s , we must have T s = T 0 . Q.E.D. 

7.6. The Schroedinger Potential Energy Operator 

The operator V representing the potential energy is given in the 
Schroedinger picture by means of a potential F(r x ,..., r n ) 

(Ps0)(*i v, r n ) = Vfa ,..., r n ) 0(r x ,..., r n j, 

where V(r ± ,..., r n ) is a real function. It is quite obvious that V s is a 
function of \ Qi \ in the sense of the definition of a function of 

commuting operators given in Theorem 2.5 (see also Exercise 2.7): 

(7.21) V s = V(Q[*\ 0<*>, Qi x) , Qi v \ 0«). 

We cannot expect, however, that V s = F s * for any choice of the real 
function V(r 1 ,..., r n ). The requirements subsequently imposed on 
V(r 1 ,..., r n ) will enable us to prove next that V s is self-adjoint and, 
later on, that T s + V s is also self-adjoint by virtue of the fact that V s 
is bounded relative to T s in the sense of the following definition. 

Definition 7.3. An operator K is said to be bounded relative to 
another operator A , or A-bounded, if Sf K D Sf A and if there are constants 
a, b ^ 0 such that || Kf\\ ^ a || Af\\ + b ||/|| for all fe 2 A . The in- 
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fimum of all values a for which there is some b such that the preceding 
condition holds true is called the relative bound of K with respect to 
A , or its A-bound. 

We note that if K is bounded in the sense of Definition 1.5 in Chapter 
III, then it is bounded relative to any operator A with domain Sf A C Sf K , 
and that its ^4-bound is zero. Indeed, in case of such a bounded operator 
K we can satisfy the conditions of Definition 7.3 with the choice a — 0 
and b = \\ K\\. However, an operator K can have a relative bound zero 
with respect to another operator A without actually being bounded. 
Such examples are provided in the next theorem which, in fact, states 
that V s has T s -bound zero, whereas the conditions on the potentials 
for which the theorem is valid are sufficiently liberal to allow for the 
possibility of V s being an unbounded operator. 


^Theorem 7*4* Assume that the potential V(r 1 ,..., r n ) can be 
written in the form 


y(Vl ?•••? ** n ) V 0 ( ri ,..., r n ) ^ Vij(Vi 


1 

i<j 


r,) + E Kofo), 

i =1 


where V { fr 1 ,..., r n ) is measurable and bounded on IR 3n , while V^r), 
i,j = 0, 1,..., n (i <j) are locally square integrable and bounded at 
infinity,* i.e., there are positive constants R , C such that 

(7.22) f | F,,(r)| 2 dr < +oo, | VJr)\ < C, r > R. 

J r^C R 

For such a potential, the operator 

(^s/)(rir n ) - F(r x ,..., ,..., r n ), 

defined on the family of all feL 2 (M? n ) for which the function 
V{r 1 ,..., ^ n )/(r i,..., r n ) is square integrable, is self-adjoint, and 
& Vs D & Ts . Moreover, the operator V s is such that for any constant 
a > 0 there is another constant b > 0 such that 

(7.23) II F s /|| < || 7s/1| + b ||/|| 

for all fe @ Ts . 


# It has become customary to denote the class of all functions /(r) having these two 
properties by L A ([R 3 ) -|- /^(IR 3 ), where L^R 3 ) and L°°(IR 3 ) denote the Banach spaces of 
functions that are measurable and, respectively, Lebesguc integrable or bounded on IR 3 . 
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Proof. It follows from the assumptions on V{r 1 r n ) that 
F(r x r, n )f{r 1 r n ) is square integrable on R 3n if fe& 0 . This 
conclusion can be easily reached by using Theorem 3.9 in Chapter II. 
This theorem can be immediately applied to V () {r 1 ,..., r n ), which is 
bounded | V 0 (r ,,..., r n )| < b, and therefore 

(7.24) | Voir, r n )/( ri ,..., r n )| < b, |/(r,,..., r n )|. 

Its application to F^(r* — r,) and V oi (r t ) is somewhat more roundabout 
(see Exercise 7.8). Since is dense in L 2 ([R 3n ) and, by virtue of (7.21), 
V s is a real function of the position observables, it follows from Theorem 
2.6 that V s is self-adjoint. 

A glance at (7.24) makes it obvious that V 0 satisfies (7.23) with a = 0 
and b = b, . The proof of (7.23) for V^ is more intricate, and it will be 
carried out next only for the representative case of V 12 . 

In the new variables 

r x = (l/VSXri - r 2 ), r 2 '= (l/VZXr! + r 2 ), r 3 '= r 3 r n '= r n , 

the function 


g(ri,..., r„') =/(!•!,..., r„) 

is of the form (7.14) whenever fe 3> 0 . This is due to the fact that 

r 'l + r 2 2 = r i 2 + r 2 2 ’ 

In addition, the Jacobian of the transformation r k —*■ r k , k = 1,..., n, 
is equal to one, 

dr x ••• dr n ' = dr 1 ••• dr n , 
so that writing Vi 2 (r x ) = V 12 (r x — r 2 ), 

II ^12/II 2 = f I l / 12 (r 1 - r 2 )f(r 1 r„)P dr,- dr n 

= f I V' X2 (r 1 ')g(r 1 ',..., r„')l 2 dr x ' ••• dr n '. 

J [R n 

To be able to estimate the above integral, we split the domain of integra¬ 
tion into two parts 



(7.25) 



7* The Formalism of Wave Mechanics 


363 


where R' = 2 and R is the constant appearing in (7.22). We shall 
study the two parts separately. 

According to Fubini’s theorem (Chapter II, Theorem 3.13) 

(7.26) f I v 'i 2 ,( r i) g( r i .r,')| 2 

J r^^R' 
r 2 e[R 3 


F(ri ')| 2 f 

ID 


I g(r i,- 


If g(Pi ,..., p n ') denotes the Fourier transform of^r^r n '), we have, 
by (4.12) and Theorem 4.6 in Chapter III, 


(7-27) f 1 ^ r ( r i%---> r n , )l 2 4r 2 ' ••• dr n ' 

J R 3(n-i) 

= (2t7/z)- 3 f dp 2 ’ ■■■ dp n ’ f exp(j'/«)(r 1 ' • Pi')|(Pi',..., p„') 4p/ 

J Dr (n-1) u 3 

< (2 ^ )_3 L.«-« ipa ' ■" ip «' LL p»')i ^'f- 


Moreover, for an arbitrary positive constant e, (1 + ^ 4 (pi 2 ) 2 ) -1 and 
I g(Pi>—> Pn )l(l + e 4 (Pi 2 ) 2 ) 1/2 are square integrable in p/ e (R 3 . Hence, 
we can apply the Schwarz-Cauchy inequality in L 2 ([R 3 ) to these two 
functions, and thus derive 

(7.28) 

[J I i(pi',-> p»')l ^Pi'] 

- 1/.. 11 + 

- ’’/ 2 f. i kpi' .p.')i‘ (i + -‘(p;')’) -'p, - 

= 77 y 2 f , l#(Pl'»-»P»')| arf Pl' + V2 7T 2 £ f | p; 2 ?(p i ',...,p„')| 2 #i' 

€ J or ^u 3 

^ “r - f JI(PiV->P»')l 2 4pi' 

€ J R 3 

+ 4 ^ *<P>'.P*')|’ ‘‘Pi's 



364 


IV* The Axiomatic Structure of Quantum Mechanics 


in the last step of the above derivation we have used the fact that, in 
terms of the momentum variables Pi, •» Pn > 


Pi 


= ~ki (Pi — Pa), Pa' = -7; (Pi + Pa), Ps' = Ps 


V2 


V2 


Pn 


so that r x • Px + r 2 • p 2 = *1 • p x ' + r 2 ' • p 2 ', and consequently 


Pi 2 = i(pi 2 - 2pi • p 2 + p 2 2 ) < Pi 2 + Pa 2 


< 2K + m 2 )(^- + -g- + - + 


P«M 

2m n ) 


Inserting the last inequality of (7.28) in (7.27), and noting that 
f , IJ(Pl',-,P»')| 2< *Pl' — d Pn 

J r 3n 

= f l^i',...,r B ')| 2 dr 1 '-dr n ' 

J K 3n 

= f |/(r 1 ,...,r„)| 2 <*r 1 - dv n = ||/|| 2 , 

J R 3n 


L- 111 11 T >"- 


we get 


h 3 


f I |'(r 1 ',..., r„')| 2 dr 2 ' ••• dr n ’ 

»'rn>3(n-l) 


]}3(n-l) 

<- X — - II /'ll 1 + (mi t < II Ttf&. 


4 V2i 


V2; 


Hence, combining (7.22), (7.25), (7.26), and (7.28) we obtain 

_L II /II* + ( m i + e II T a f\A f I Vi 

^ V2iT fi 3 11 oJ11 


Vi 


2 /ll 2 < [■ 






L 4 V27T 

’ €® • 

+ C 2 f 

dr. 

J n 

'>R' 

/ ^12 

__L_, 

\ 4 \/2n 

e 3 4 

\l Cl 2 

1 


'<R' 


rrn3(n-l) 


^ + c ,)ll / ,|, + i2 1 +g)!£u,|| W 


v 1/2 


v 1/2 


4 \^2ti 


\/2ti 


where c 12 is a constant defined in general as follows: 


= *“ 3 f I ^(r/)| 2 */, i<j. 
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Since e > 0 can be chosen arbitrarily small, we have arrived at the result 
that the inequality (7.23) is true for all fe@ 0 and any a priori given 
a > 0, if we choose e such that 


(7.29) 


>v ry r (ntj + m } f % i 1 ' 2 

vi- J 


Take now a vector h e , and let h x , h 2 ,... e be a sequence for 
which 


= s-lim 7y* n . 

w-»°o 


Since (7.23) has been established for vectors from , we can write 
II V s (h m - h n ) || < a || T s (h m - A n )|| + b\\h m -h n ||, 
which implies that V s h x , V s h 2 ,... is a Cauchy sequence. Hence 

h! = s-lim V s h n 


exists, and since V s is self-adjoint, and therefore closed, we have h e &> Vs 
and h! = V$h\ hence Sj Ts C ® Vs - Furthermore, 

II Vsh\\ ^ II V s h n II + II V s (h — A n ) || < a || T s h n || + b || h n || + || V s (h — h n ) ||, 

and in the limit n -> + oo we obtain (7.23) for an arbitrary h e @ Ts - 

Q.E.D. 


The very general type of potential with which Theorem 7.4 deals 
consists of three distinct parts: V 0 , which describes a global interaction 
in between all n particles; Vy , describing a two-body interaction in 
between particle i and particle j\ and V oi , describing the influence of 
some sources external to the system (such as the external fields discussed 
in Section 7.9) onto particle i. As examples of particular practical impor¬ 
tance we can mention the case of Coulomb and Yukawa two-body 
interactions corresponding to the choice 


(7.30) 


Vo( r i >•••> **n) — Fo$(r$) — 0, 

= A,, exp(-#c tf Xr< - vd) 


i < h 


with Ky = 0 and k y > 0, respectively, where A ^ are coupling constants 
(which in the Coulomb case equal the product of the respective 
charges of the ith and ;th particle—see Chapter II, Section 7.7). 
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7.7. The Self-Adjointness of the Schroedinger Operator 

The standard method of proving the self-adjointness of an operator 
of the form A + K, where A is already known to be self-adjoint, is 
based on a result first proven by Rellich [1939] (see Lemma 7.2 later in 
this section), in which the useful notion of operator core figures 
prominently. 

Definition 7A. Let A be a closed operator in Jf, Z) a subset of 
Sf A , and A d the restriction of A to Z), so that A D C A. The set D is 
called a core of A if A D = A. 

By Theorem 7.3 the set is a core for the self-adjoint operator T s . 
As has been illustrated so many time on the differential operator forms 
we have encountered in preceding sections, such forms define operators 
on L 2 spaces only when applied to suitably chosen sets D of functions. 
In order that such forms unambiguously lead to self-adjoint operators, 
it is essential that these sets D represent cores (i.e., that the action of the 
differential operator form on all elements of D represent an essentially 
self-adjoint operator). It should be realized, however, that one and the 
same operator can have many cores. In fact, if we reconsider the proof 
of Theorem 4.10 in Chapter III, we see that the argument used there 
leads to the following more general conclusion. 

Lemma 7*1 ♦ A densely defined symmetric operator is essentially 
self-adjoint if and only if both its deficiency indices are zero. 

Thus, as long as for some A = A* a set D C Of A is such that the 
restriction A D of A to D has a Cayley transform whose domain &t A 
and range & Aj) -i are dense in that set D is going to represent a 
core for A. For example, we might have used Lemma 5.2 to build dense 
sets D of functions (5.18) with a choice of /o^ r n ) that would be 
different from the exponential function in (7.14) and thus arrive at 
other* cores for T s , but then the computations of the Fourier trans¬ 
form of such functions needed to establish Theorem 7.3 would not have 
been as straightforward. 

Lemma 7*2* (Rellich 9 s theorem). If A = A* and K is symmetric 
and relatively bounded with respect to A with an ^4-bound smaller than 
1, then A + K is a self-adjoint operator with domain 3f A + K = Sfi A ; 
furthermore, A + K is essentially self-adjoint on any core of A . 

Proof. We shall prove first that ^4 + is self-adjoint by showing 
that its Cayley transform V is unitary (see Chapter III, Theorem 4.9), 
i.e., that & A+ x + i = & A +x_i = . 

* The set D — ^t,(D^ 3n ) is a very popular alternative choice (e.g., see Reed and Simon 
[1975], Sections X.1-X.4]). 



Since for any A > 0 the operator A ~ X A is self-adjoint, we infer from 
Theorems 4.7 and 4.9 of Chapter III that the ranges of the operators 
A i iX = A(A -1 ^4 zt i) coincide with Jf 7 and that (A i /A) -1 exist. 
Moreover, (A i /A) -1 g e Q) A C Q) K for all g e ^ and in accordance 
with the Definition 7.3 of relative boundedness, 

(7.30) || K(A ± i\)-'g || < a || A(A ± iAy'g || + b || (A ± iAy'g ||, 

where we can choose a, b ^ 0 with a < 1, since the A -bound of K 
is assumed smaller than 1. On the other hand, as in (4.14) of Chapter III, 

(7.31) \\{A ±*A)/|| 2 = M/|| 2 + A 2 1|/|| 2 , feS Ai 

so that by setting above / = (A i /A) -1 g we get 

II A(A ± i\)-'g II < \\(A ± iA)(A ± iX)-'g II = II ^ II, g e JT, 

a I \(A ± ixy'g II < II (A ± iX)(A ± ixy'g II = II ^ II, g G je. 


Consequently, chossing in (7.30) at fixed 0 ^ a < 1 and J >0 a 
sufficiently large value of A > 0, we obtain 

II K(A ± i\)-'g || < (a + bl A) |k II < Ik II, g e 

This implies (see Exercise 7.9) that the range of 1 -f K{A i zA) -1 
equals Jf 7 . Since we have seen that the ranges of A ± *A equal 
we conclude that the ranges of 

A+K±iX = [ 1 + K(A ± iXy^A ± iX) 

also equal By Theorem 4.9 in Chapter III this implies that 
A -1 M + K) m self-adjoint, and therefore so is A + K. 

If D is a core of A then for any/e^ there must be a sequence 
fi y y *2 »••• E D such that/ n —► / and Af n —► Af as n —► oo. But then 

II -f n )\| < a || A{f-f n )\| + b ||/ -f n || - 0 

so that the domain of the closure (A -j-_K) D of A + K restricted to D 
contains the domain of the closure A D = A. Since @ A + K = Of A and 
A h K is self-adjoint, we conclude that (A + K) D = A + K y i.e., 
D is a core of A + K. Q.E.D. 

Since by Theorem 7.3 the T^-bound of V s is zero, we can immediately 
apply the above lemma to T s -|- V 8 , and thus state the following result. 

Theorem 7*5* (Kata's theorem ). The linear operator Hq acting 
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on any function of the form (7.14) as follows, 

(7.32) (H S J){ 

= [-K* 2 ) i (Or 1 A* + V(ri ..... r„)]/(r,..... t n ), 

is essentially self-adjoint on i f the potential satisfies the conditions 
of Theorem 7.4, and its closure H@ q coincides with the Schroedinger 
operator H s = T s + V s , which is self-adjoint and has the same domain 
as T s in Theorem 7.3. 

Theorem 7.5 was first established in essentially the above form by 
T. Kato [1951]. As discussed earlier, there is a great latitude in the choice 
of core, and a rather popular alternative choice is 

Although Theorem 7.5 is applicable to all the specific cases of inter¬ 
actions treated in the next chapter, it does not cover all cases of physical 
interest. However, extensive studies of essential self-adjointness of other 
types of Hamiltonians appearing 4n wave mechanics have been carried 
out by a host of researchers,* 6o that all cases of practical interest in 
nonrelativistic quantum mechanics have by now received suitable treat¬ 
ment. From a purely mathematical point of view, this field of study is 
a subdiscipline of the theory of elliptic differential operators, and 
considering the problem in the abstract one cannot expect to have self¬ 
adjointness for arbitrary choices of potentials—such as potentials more 
singular than r~ 2 at fhe origin. 

When applied >0 Schroedinger operators the general theory of elliptic 
differential operators (see Hormander [1969]) leads to other physically 
important results, such as the following theorem which we state without 
proof. 

Theorem 7*6* Any eigenvector iff of the Schroedinger operator 
in Theorem 7.5 can be represented by a function iff{r 1 r n ) that has 
continuous second derivatives in the region where the potential has 
continuous first derivatives, and it satisfies there the time-independent 
Schroedinger equation 

(7.33) |f ■ d k + [A - V(r 1 ,..., r M )]j 4>(*i ..... r n ) = 0. 

This theorem assures us that by completely solving (7.33) (as we have 
done in §7.7 of Chapter II for a particle moving in a Coulomb potential), 
we indeed obtain all the eigenvectors and eigenvalues of the Schroedinger 
operator H s , i.e., that there are no additional eigenvectors / e & Hs 

* Stummel [1956], Wienholtz [1958], Jorgens [1964], Schminke [1972], and many 
others; see also Sections X2, X4, and X5 of Reed and Simon [1975] for a review of some 
of these results. 
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that we might not have reached through the differential equation (7.33) 
because they would not have been representable by differentiable func¬ 
tions. 

7.8. The Angular Momentum Operators 

In classical mechanics angular momentum is next in importance to 
energy, position, and momentum. This marked significance of angular 
momentum observables is retained in quantum mechanics. 

Following the analogy with classical mechanics, we postulate that the 
components of the angular momentum of a single spinless particle 
along the x, y, and z axes, respectively, are represented in quantum 
mechanics by the following operators inL 2 ([R 3 ): 

L (x) D Q(y)pw — piv)Q(z)^ L(y) 2 Q(z)p(x) _ p(z)Q(y)^ 

^L ( z) D QMpiv) — p(x)Q(y) m 

The expressions on the right-hand side of these relations are obviously 
symmetric operators. Indeed, consider for example L {z) : Q {x) and P {y \ 
as well as P {x) and Q iv) , are commuting self-adjoint operators, so that 
Q(x)p(y) an( j p(x)Q(y) are self-adjoint by Theorem 2.6. Hence Qwpw — 
p(x)Q(y) j s certainly symmetric, but in view of Nelson’s counter-example,* 
we cannot claim immediately that it is self-adjoint, despite the fact that 

(7.35) W>P'''Mr) - (Q w P^)(r) = ~ ih { x Yy^ y ^ 

is well-defined on certain dense linear sets, such as ^/(IR 3 ). 

A simple way of bypassing this technical difficulty is through the use 
of Stone’s theorem (Theorem 6.1). We consider therefore rotations 
R { ^ z) of the Cartesian inertial frame of reference by an angle <f> around the 
z axis. For a fixed spatial point, such a rotation gives rise to a change 
in its coordinates (x, y, z) with respect to the original frame to coordinates 

(7.36) x' = x cos <f> + y sin <£, y* = — x sin <f> + y cos <f >, z f = z> 

with respect to the new frame. We surmise + that the configuration 
space wave function ifj( r) = y, z) will be represented in this new 
frame by 

f (*') = ftr) = *y rl = > 

* See the note following Exercise 1.2 on p. 268. A concise discussion of this counter¬ 
example can be found in Section VIII.5 of Reed and Simon [1972]. 

• Since by Born’s correspondence rule (Definition 1.3) only expectation values (1.14) 
and (1.15) and not state vectors themselves are measurable, a rigorous argument has 
to be based on group-theoretical approaches to quantum mechanics (see, e.g., Miller 
[1972], Barut and Rfcxka [1977]). 
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where, in general, if R denotes a given rotation of the coordinates system, 
1 stands for the inverse rotation, so that R _1 R = RR- 1 is the identity 
transformation [in fact, note that a rotation is an isometric transfor¬ 
mation in the real Euclidean space (IR 3 ) and as such it can be viewed as 
a unitary operator on ((R 3 )— cf. Chapter III, Exercise 4.1]. These con¬ 
siderations suggest the introduction in L 2 (U 3 ) of the linear operators 

(7.37) (U^)( r) = 

that are unitary since the transformation (7.36) is obviously linear and 
one-to-one, and its Jacobian equals one. Furthermore, it is easy to check 
that 

W;’ = <h .^R 1 . 

so that by Stone’s theorem there is a unique self-adjoint operator 
L {z) for which 

(7.38) E/<*> = exp <f> e U 1 . 

On the other hand, by Theorem 3.1, 

(7.39) = s-Km[(E/<f>./. - 0)/fl, * e® L < z) CL\W), 

and for </f(r) e < ^ 7 b 1 (IR 3 ), the outcome of taking the above limit coincides 
with (7.35) multiplied by i/it , as can be easily verified by inserting the 
expressions (7.36) for R$ z) r into (7.37) and explicitly computing the limit 
by elementary calculus techniques. 

By also considering rotations around the x and y axes we arrive in a 
similar manner at definitions of L {x) and L {y) as self-adjoint operators 
that are the extensions of the symmetric operators appearing on the 
right-hand side of the relations in (7.34). The so-obtained operators, 
however, do not commute, and by (6.6) (see Axiom W3) we easily obtain 

(7.40) [L<*>, L {y) ] C «L<*>, [L<*>, L<*>] C ihD*\ L<*>] C ihU y \ 

Thus in defining the square of the angular momentum operator by 
analogy with classical mechanics conventions, 

(7.41) L 2 = (L (£C) ) 2 + (L (y) ) 2 + (L (2) ) 2 , 

we gain run into the technical problem of establishing self-adjointness 
—as opposed to mere symmetry properties. 

General (Casimir operator) techniques for dealing with problems 
of this nature are made available by the group theoretical approach 
to symmetry problems in quantum mechanics. These techniques are, 
however, beyond the scope of this book, but fortunately we can handle 
this particular case by elementary techniques based on the results of 
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§7.6 in Chapter II. Indeed, according to those results, L 2 {Q V , /x p ) = 
L 2 (Q T , fji T ) (x)L 2 (£? 8 , and (Chapter II, Theorem 7.1) the spherical 
harmonics F z m constitute an orthonormal basis in L 2 (D 8 , /z 8 ). Let us 
therefore set by definition 

(7.42) (L 2 0)M,^) 

= ** f Ki + i)I Yr(e, f 0*(r, e\ f) <!>') f) 

whenever the infinite series on the right-hand side of (7.42) converges 
in the mean to an element of L 2 (Q p , ^t p ). The resulting operator is self- 
adjoint (see Chapter III, Theorem 5.2) and by Theorem 7.7 in Chapter II 
its action on 6, cf>) e W h 2 (& v ) (x) ^ p 2 (.Q 8 ) coincides with that of the 
differential operator form obtained by inserting (7.34) into (7.41). 

One of the useful by-products of these considerations is the joint 
spectral measure £Iouli,l ( z) of [ Q |, | L |, andL (0) , which for B = B ± x 
B 2 X B%g is defined by 

(7.43) (£|Q|,|L| x Bj x B z mr, 6, <f>) 

= x B y) i i m^) 

ftleB 2 fimeB 3 

X f 4,*(r,0',<l>')Yr(0',<!>') dr s(0',f) 

J n s 

where, naturally, l e (0, 1,...}, me +/}. Clearly, the existence of 

this measure extablishes that (| Q |, | L | ,LJ constitutes a complete set 
of observables inL 2 (£? p , /z p ). We note that, strictly speaking, this measure 
corresponds in fact to (| Q \ y L' y L z } y where 

(7.44) L' = M—1 +0 + 4L 2 ) 1 / 2 ], 

but in accordance with a tradition dating back to old quantum theory 
(see Messiah [1962]) the variable fil is thought of as being related directly 
to | L | = (L 2 )*, albeit the eigenvalues of L 2 actually equal fi 2 l(l + !)• 
An even more frequently used complete set of observables is 
(| P |, | L |, LJ. If f(k y 0 y (f)) equals the Fourier-Plancherel transform 
of ^A(r) expressed in spherical coordinates, then 

(£ |P|. |L| ,z» (Bi x b 2 X B 3 W)~(k, 6,</>) 

- x B (k) I £ 4>) 

MgB 2 flmeB n 

X f I O’, </>’) <f>') sin O' dO' 


(7.45) 
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In the presence of spin the operators (7.34) retain the meaning of 
orbital angular momentum components, but spin components have to 
be included to arrive at total angular momentum operators. Naturally, 
as seen in the discussion in Section 7.1, in that case S {z) has to be added 
to the sets consisting of | Q | (or | P |), | L |, and L iz) to arrive at complete 
sets of observables. 

**7.9. Time-Dependent Hamiltonians 

In §3 we had formulated the Schroedinger picture for the general case 
of time-dependent Hamiltonians H(t). Such time-dependence would 
be encountered only if the system under consideration is not isolated 
from its environment, so that energy is being exchanged between the 
system and some external field sources. Naturally, as mentioned in the 
footnote on p. 292, by including those sources as part of the system we 
arrive at a new, enlarged system whose Hamiltonian is not time-depen¬ 
dent, but the resulting mathematical model might be either too difficult 
to handle exactly, or an appropiate mathematical description might not 
even be available within the nonrelativistic context studied in this volume. 
A typical instance of the second kind is provided by a charged particle 
(electron, proton, ion, etc.) moving in a laser cavity and therefore con¬ 
tinuously bombarded by photons. The exact mathematical description 
of the swarm of photons filling the cavity requires the quantization of 
the electromagnetic field within the cavity and therefore comes under 
the heading of (relativistic) quantum field theory. However, if the particle 
moves at nonrelativistic speeds in relation to the cavity, the time-depen- 
dent (nonrelativistic) Hamiltonian 

(7.46) H(t) D (2m)" 1 [-ih V - (e/c) A(r, t )] 2 + eA 0 ( r, t) 

for a particle of charge e in an external four-potential ( A 0 , A) related 
to the electromagnetic field in the cavity might represent a reasonable 
approximation. Rellich’s theorem (Lemma 7.2) provides a suitable tool 
for establishing the self-adjointness of (7.46) under reasonable conditions 
on the functions A 0 , A (x) , A (v) , and A (z) . However, we are still faced 
with the problem of defining the time-evolution operator (see p. 292) 

(7.47) U(t, g = T exp [-(*/*) f * H(t) dr] 

by giving a suitable meaning to the above integration symbol, which in 
physical literature bears the name of time-ordered integral . Indeed, 
since i/(f x ) does not in general commute with H(t 2 ) for t x t 2 , we 



7. The Formalism of Wave Mechanics 


373 


have to exercise caution in preserving the appropriate time-ordering as 
we (heuristically) combine “infinitesimal” contributions in the following 
manner, 


(7.48) Ufa + dt , * 0 ) = exp[— {ijh) H(t x ) dt] Ufa , * 0 ), 

to arrive at a sensible definition of U(t , t 0 ) which satisfies (3.16)—(3.18). 

The clue for a general definition of (7.47) is provided by (3.17) 
which, if valid, by mathematical induction leads to the conclusion that 

(7.49) Ufa t 0 ) = Ufa t n ) Ufa , t n _j) ••• Ufa , Ufa , t 0 ) 
regardless of how small 

8 = max{| t — t n |,| tj — ^ | :j = 1,..., n} 
is chosen to be. Hence we set, by definition, 

(7.50) U(t, t 0 ) = s-lim n exp[—(i/h) H^t, - 

5->+0 i= n 

where the order j = n, n — 1 ,..., 1 in the above operator product is 
very essential. Under suitable conditions on H(t) [such as the existence 
of a common domain of definition for all H(t)], it can be shown not only 
that the strong limit (7.50) does exist, but also that Axiom S.3 on p. 292 
is indeed satisfied. 

In Chapter V, we apply the framework of this chapter to quantum 
scattering problems and we shall concentrate on the case of time- 
independent Hamiltonians, for which it is easily seen that (7.50) indeed 
equals (3.2). Hence we direct the reader interested in details of the 
general theory of time-ordered integrals (7.47) to Chapter XIV, Section 4, 
of Yosida [1974], and for details on the general scattering theory for 
time-dependent Hamiltonians extrapolating that of Chapter V to the 
articles by Prugovecki and Tip [1974]. 


Exercises 

7*L Let and ^ n " be Hilbert spaces. Prove that 

the mapping of 
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into 


z=l j=*l 

which takes (©“ 1 //) ® (®“=i//') into ©<li ©“_ x (// ®//), is a 
unitary transformation. 

7*2* Generalize and prove the statement contained in Exercise 7.1 to 
the cases when (a) m = + oo, n < + oo; (b) m < + oo, n = + oo; 
(c) m = n = +oo. 

7*3* Let £+ be a unitary transformation of the Hilbert space 3^ k ' 
onto the Hilbert space 3^ k for k — 1,2,.... Show that the linear 
transformation U = U 1 © U 2 © ••• of 3^ = into = 

©fc=i ^ unitary, where U is defined by 

U(A ©/■©•••) = c/xA © u 2 f 2 © A e e *5,.... 

7*4* Prove that the unitary transformation U of the Hilbert space 
(7.5) onto the Hilbert space (7.9), defined in the text, maps the subspace 
of all symmetric (antisymmetric) functions from in (7.5) onto the 
subspace of all symmetric (antisymmetric) functions from 3^ in (7.9); 
more generally, it maps any subspace of “appropriately’ ’ symmetrized 
functions in (7.5) onto the subspace of functions in (7.9) “appropriately” 
symmetrized in the same way (see Definition 5.1). 

7*5* Derive from Axioms W1-W3 the result that{P (:c) , P (v \ P {z) , S n } 
is a complete set of observables. 

7*6* Show that if ^4** exists, then ^4*** = A *. 

7*7* Show that the Fourier transform of a function 

P( r i >•••> r «) ex P [- ^ (V H-b r« 2 )] 

is a function of the form P^pip % ) exp[— + ••• + p n 2 \. 

7*8* Show that if V{r) is locally square integrable and bounded at 
infinity, then 

v ( r i - r i)f( r i . r 2 )> /(ri. r 2 ) = P(r t , r 2 ) exp [- ^ (r t 2 + r 2 2 )] 

is square integrable on (R 6 . 

7*9* Prove that the range of 1 + A, where || A || < 1, is the entire 
Hilbert space Jf, and that (1 + A)- 1 = u-lim^oo (—1)M*. 
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*8* Completely Continuous Operators 
and Statistical Operators 

8.1. Completely Continuous Operators 

A compact set in a complete metric space—in particular, in a Hilbert 
space—is a set having the property that any infinite sequence of elements 
from that set has a convergent subsequence. Linear operators on Banach 
or Hilbert spaces which map bounded sets into compact sets are called 
completely continuous or compact. The following equivalent definition 
of completely continuous operators emphasizes their main feature which 
is of importance to us. 

Definition 8*1 ♦ The linear operator A on the Hilbert space is 
said to be completely continuous (or compact) if for every infinite sequence 
fi ,/ 2 £ J?, such that \\f n || ^ C for all n = 1, 2,..., the sequence 

Af x , Af 2 ,... contains a strongly convergent subsequence. 

We shall see later that the density operator p describing states in 
statistical quantum mechanics, as well as the product pA 0 , where A 0 is 
any bounded observable, are completely continuous operators. The 
theory of completely continuous operators has many applications, some 
of which we shall encounter in Chapter V. For this reason, the 
foundations of this theory deserve special attention. 

Theorem 8*1 ♦ Completely continuous operators have the following 
properties: 

(a) They are bounded. 

(b) The products AB and BA of a completely continuous operator A 
and any bounded operator B on 3^ are completely continuous operators. 

(c) The linear combination a x A x + a 2 A 2 of two completely continuous 
operators is a completely continuous operator. 

Proof . (a) Suppose that A is not bounded, i.e., that there is no 

constant C such that || Af\\ < C||/|| for allIn that case we can 
find a sequence f x ,/ 2 ,... of normalized vectors, || f n || = 1, such that 
|| Af x || , Af 2 || ,... diverges to infinity. But then Af x , Af 2 ,... contains 
no convergent subsequence, which is impossible if A is completely 
continuous. 

(b) If / x ,/ 2 ,... is any bounded sequence, then Af x , Af 2 ,... contains a 
convergent subsequence Af ji , Af ^ ,... . Since B is bounded and therefore 
continuous (see Chapter III, Theorem 1.3), BAf ji , BAf ^ ,... converges, 
i.e., BA is completely continuous. Further, Bf x , Bf 2 ,... is bounded. 
Therefore, there must be a subsequence Bf k% , Bf k ^ ,... such that 
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A(Bf ki ), A(Bf k ^y... is convergent. Consequently, AB is also completely 
continuous. 

(c) If g ly g 2 >••• is any bounded sequence, there is a subsequence 
g tl ,gi 2 .... such that is convergent. Moreover, since g tl ,g tt 

is also bounded, it must contain a subsequence g ki , g k% such that 
A 2 g kl > Azgk 2 >••• is convergent. Consequently, the sequence a x A x g k ^ + 
a 2 A 2 g ki ,... is convergent, which shows that a 1 A 1 + a 2 A 2 is completely 
continuous. Q. E. D. 

Theorem 8*2* If the linear operator A on is the uniform limit 
of a sequence A x , A 2 ,... of completely continuous operators, then A is 
completely continuous. 

Proof '. Let f x ,/ 2 ,... be any bounded sequence, \\f n || < C, of vectors 
from Construct a family of subsequences 

of/ x ,/ 2 ,... by the recursive procedure in which, for every i — 0, 1,2,..., 
{/n +1) } is a subsequence of {/^} such that {^+i/n +1> } is convergent. 
Since {/j^} is a subsequence of {f n } and therefore bounded, {fn +1) } 
exists, due to the complete continuity of A i+1 . 

From the inequality 

|| Af}*'-Af™ || 

< II Af™ - AJ}« II + II AJ}» - AJ™ II + II AJj* - Af™ II 

< M - A Jl (||/«> II + II/<*> II) + II AJJM -/<*>)|| 
<2C\\A-AJ+\\AJJ«'-f™% 

we infer that {Af^} is convergent; namely, for any given € > 0 we have 
2C\\A- A n || < e/2 for sufficiently large n, while 

if sufficiently large *, k ^ n are chosen. Q.E.D. 

There are many continuous linear operators on an infinite-dimensional 
Hilbert space which are not completely continuous. A conspicuous 
example is the identity operator on (see Exercise 8.1). 

As a generalization of the operator | / > </1, let us define for any two 
vectors/, g e the operator | / ) <g | in the following way: 

(8.1) \f><g\f' = <*!/'>/, f'e*. 
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Lemma 8*1 ♦ Every operator of the form A = |/> (g | is completely 
continuous. 

Proof. In fact, if f x ,/ 2 is a bounded sequence, <g |/ x >, <g |/ 2 >,... 
is obviously a bounded sequence of numbers. Hence, the set {< g | / n >} 
has at least one accumulation point a. Thus, we can choose a subsequence 
{<£ | f kn >} converging to a. Consequently 


s-lim Af kn = film (g \f k } = ocf 

n-*ao n n-*oo ” 

exists. Hence, A = |/> (g | is completely continuous. Q.E.D. 

It follows from the above result and Theorem 8.1(c) that any operator 
of finite rank> i.e., an operator that can be reduced to the form 

( g - 2 ) t K I <> < e fc I = Z K>4<**l > 

fc=l k =1 

is a completely continuous operator. Moreover, by Theorem 8.2, any 
operator which is the uniform limit of operators of the form (8.2) is 
completely continuous. We shall prove that, conversely, any completely 
continuous operator is such a uniform limit. To prove this result we need 
two additional results which will be derived as part of Theorem 8.3 and 
in Lemma 8.2. 

It should be evident by now that all linear operators on a finite¬ 
dimensional Hilbert space are completely continuous. In fact, if A 
is a linear operator on and {e x ,..., e n } is an orthornormal basis in 
then it is easily seen that 

A = t I A fc > <«* |, 

1 

where h k = Ae k ,k = 1,..., n. In view of Lemma 8.1, we have established 
that A is completely continuous. 

Theorem 83* Every symmetric and completely continuous 
operator A has a pure point spectrum S A = Sp, which has no accumula¬ 
tion points, with the possible exception of zero. The characteristic 
subspace of every nonzero eigenvalue is finite dimensional. 

Proof. Assume that a ^ 0 is an accumulation point of the point 
spectrum of A. This would imply that there is an infinite sequence 
Aj , A a of distinct eigenvalues of A such that | A x |, | A 2 |,... > J | a |. If 
f x , / 2 are the corresponding unit eigenvectors 

Afn ^ A nfn » W = 1 , 2 ,..., 
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the sequence Af x , Af 2 ,... has no convergent subsequence since for any 
m n we have X m ^ X n , which implies that </ m | / n ) — 0 and 

II Af m — Af n |1 2 = | A m | 2 + | A w | 2 > 0,2/2. 

Thus, A would not be completely continuous. Hence, Sp has no nonzero 
accumulation points. 

Assume now that the continuous spectrum Sp of A is not empty, so 
that contains infinitely many points (see Exercise 8.2). Choose an 
infinite sequence of nonzero A/, A 2 ',... e Sp such that | A/ I < I A 2 ' | < 
and positive numbers r x , r 2 ,... such that the intervals I n , n = 1, 2,... 
between the points X n ' — r n and X n ' + r n are disjoint and do not contain 
the origin. Since A/, A 2 ',... are points of the spectrum of A , the projectors 
E A (I n ) are nonzero. Therefore, there are vectors^ , g 2 ,... satisfying 

E A (I n )g n =g n , ||£„|| = 1, n = 1,2,.... 

Due to the disjointness of the intervals I n , E A (I m ) E A (I n ) = 0 when 
m ^ n. Using the spectral theorem, it is easy to compute that 

<dg m I Ag n } = <g m | A 2 g n y 

= f M<gm\E/ gn > 

In 

( = 0 for m ^ n 

\ > (I A w ' | — r n ) 2 for m = n. 


Thus, we have for m ^ n 

II Ag m - Ag n ||* = || Ag m || 2 + || Ag n || 2 > 2(| A/ | - r,) 2 , 

i.e., Ag x , Ag 2 ,... contains no convergent subsequence. Hence, if A is 
completely continuous, its continuous spectrum must be empty. 

If M is a characteristic space of A corresponding to the eigenvalue A, 
then Af = Xf for all/e M. Consequently, if A ^ 0, A is a multiple of the 
identity operator when restricted to M. Therefore, M has to be finite 
dimensional if A is completely continuous (see Exercise 8.1). Q.E.D. 

Lemma 8*2* Every completely continuous operator A on can be 
written in the form 


A = VK, K= VA*A, 

where if is a positive-definite completely continuous operator, and V is an 
isometric operator defined on the range of K. 
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Proof. Since A is completely continuous and A* is continuous (see 
Chapter III, Theorem 2.7), A*A is completely continuous. Hence, A*A, 
and consequently also K = \/A*A, have a pure point spectrum, which 
has no accumulation points, with the possible exception of zero (see 
Theorem 8.3). This implies that if is a completely continuous operator 
(see Exercise 8.3). 

Let us define now the operator V on the range & K of K by setting for 
every g e & K 

Vg = Af, g = Kfe® K . 

The operator V is uniquely defined for every g e & K . In fact, if g = Kf = 
Kfi for / ^/i,we have 

II Af - Af, || 2 = </-/, | A*A(f-f 1 )> = </-/i I K\f -/0> = 0, 

which implies that Af = Af, . 

It is easy to check that V is linear. From the relations 

II Vg || 2 = <Af\Af> = <f\A*Afy = </| Kf} 

= <Kf\Kf)=\\g\\\ ge@ K , 

we infer that V is isometric. Q.E.D. 

We are now ready to investigate the relation of completely continuous 
operators in general to operators of the form |/> |. 

Theorem 8*4* An operator A on any infinitely dimensional Hilbert 
space is completely continuous if and only if there are orthonormal 
systems {e x , e 2 ,...} and {e/, e 2 ',—} in and positive numbers 
Aj , A 2 ,.*• —* 0, such that A can be reduced to the canonical form 

(8-3) ,4 =]>>*'> A* 0*|, 

k 

where, in case that the sum is infinite, we define 

(8-4) X | e k ') A* Oft I = u-lim £ | e k / A* <e k \ , 

£i "~* +c0 sti 

and the set {A /c } consists of all the eigenvalues of y/A*A. 

Proof. According to Lemma 8.1 and Theorem 8.1(c), an operator of 
the form 

X I e k'y A k ( e k I 

k*»l 


(8.5) 
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is completely continuous. Hence, by Theorem 8.2, any operator which 
is the uniform limit of operators (8.5) is also completely continuous. 

Conversely, assume that A is a completely continuous operator. 
According to Lemma 8.2, A = VK , K = \/A*A, where K is positive 
definite and completely continuous. Hence, in view of Theorem 8.3 and 
Theorem 6.4 in Chapter III the spectral resolution of K is 

(8.6) K= f \dEf = Z i X/E Mj , 

J U 1 j 

where A/, A 2 ',... are the eigenvalues of K; in addition, if K has infinitely 
many eigenvalues, the convergence of the series in (8.6) is uniform. By 
selecting an orthonormal basis in each of the characteristic subspaces M ; - 
(which are finite dimensional by Theorem 8.3), we arrive at an ortho¬ 
normal system {e x , e 2 ,...} in 

Let us set X k = A/ if e k e . Assume that we are dealing with the 
nontrivial case when K has infinitely many nonzero eigenvalues 

n oo 

K = u-lim £ A fc | e k ) <e fc | = £ | e k ) X k <e k \ . 

fc_i k-i 


Since V is isometric, 

(8.7) A = VK = u-lim £ A, F | e*> <e, | . 

n-> oo , ^ 

Setting e k ' = Ve k , we can easily check that 

(Y I **> <** I)/ = V«eic I/> e k ) = <e k |/> e k , 

i.e., V | e k y(e k \ = \ e k y(e k |. Inserting this result in (8.7), we arrive at 
(8.3). Moreover, the isometry of V implies that {e/, is an ortho¬ 

normal system since {e x , e 2 ,...} is orthonormal. Q.E.D. 

8.2. The Trace of a Linear Operator 

In order to be able to study systematically different classes of 
completely continuous operators, we need the concept of trace. 

Definition 8*2* A linear operator A defined on the separable 
Hilbert space is said to be of the trace class if the series X k (e k \ Ae k y 
converges and has the same value in any orthonormal basis {^} of 
The sum 

( 8 . 8 ) TrA=Z<e k \Ae k ) 

Tc 


is called the trace of A. 
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We shall investigate the existence of the above trace in the context of 
future theorems. But first we would like to remark that every linear 
operator A on an w-dimensional Hilbert space has a trace; namely, if 
{ e i >•••> e n } an d { e ii—j e n } are any two orthonormal basis, the sum in 
(8.8) is finite and the same in both bases: 

n n 

E < e * I Ae *> = E < e * I «/><«/ I ^«/><«/ I «*> 

k =1 1 

= E < e i I Ae /> (E <«/1 «*><«» l o) 

= E <«,' l 

i=l 


We note that in the infinite-dimensional case we cannot immediately 
establish, using the same method, that the sum in (8.5) does not depend 
on the basis employed. The reason is that in dealing with multiple 
infinite sums the interchange of the order of summations is legitimate 
only if we know that the convergence of the multiple sum is 
unconditional. We shall see that this is the case when A is positive 
definite , i.e., when < / | Afy ^ 0 for all /e 


Lemma 83* If A = K*K , where K is a linear operator defined 
on the infinite-dimensional separable Hilbert space then the series 


(8.9) 


E 


l 


00 


<e k \Ae k y = E \\Ke k ||« 
1 


has the same sum in all orthonormal bases {e x , e 2 ,...} on \ thus, if that 
sum is finite, Tr A exists. Moreover, A is bounded and || A || ^ Tr A. 


Proof. Let {e x , e 2 ,...} and {ef, e 2 \...} be any two orthonormal bases 
in 3%*. The double series 


( 8 . 10 ) 


E 




\<e t ' I Ke k }| 2 


18 a series with positive terms; it converges if and only if the three sums 
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in (8.11) are equal (see, e.g., Randolph [1968, p. 162, Theorem 3]), 
(8.11) £ (£ |<e/ | Ke k y |*) = £ ( £ |<e/ | Ke k }|») 

<=1 1 fe=l X 7 


= £ |<*/ I 

< f fe=« 1 

According to ParsevaPs relation [(4.15) of Chapter I], 


Z |<*/ I 1&*>| 2 = Z <**«/ I **><** I K%'} = || | 


X |<«,' I 1&*>| 2 = Z <1&* I 0<*/ I Ke k y = || Ke k || 2 . 

i=l i=l 

Hence, from (8.11), (8.9) converges if and only if (8.10) converges, and 

Z II Ke « II 2 = I II K **i II 2 = Z l« I K **i> I 2 

1 i =1 ij=* 1 

= £ \W I Ke/>\' = £ || Ke/ ||», 

i,j =1 i=l 

thus proving that Tr A exists. 

By using the Schwarz-Cauchy inequality, which yields \(^Ke i | Ke k y\ ^ 
II Ke t || || Ke k ||, we get 


(Tr A) 2 = (£ II Ke k || 2 ^ = £ \\ Ke t \\ 2 \\ Ke k 


i,k= 1 


> Z \<Ke { I Ke k) \* = £ l<«< I ^«*>l 2 


i,k=l 


i,k=l 


= £ ( £ <4e fc I *><* I = £ II Ae k ||», 

fc=l ' i= 1 k= 1 

since the interchanges of the order of summation are permissible on 
account of the unconditional convergence of a series with positive terms. 
From the above we obtain 

(8.12) MeJ 2 < (Tr A) 2 . 

Given any nonzero vector /e<^, we can adopt //||/|| to be the first 
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element e x of an orthonormal series, and, therefore, we have by (8.12) 

M/|| <(Tr J)||/||. 

The above inequality shows that A is bounded and \\ A || ^ Tr A. 

Q.E.D. 


8.3. Hilbert-Schmidt Operators 

In Chapter V we shall encounter integral operators which belong to a 
special class of completely continuous operators called the Hilbert- 
Schmidt class. 

Definition 83* A completely continuous operator, having the 
canonical decomposition (8.3), is called a Hilbert-Schmidt operator if 

(8.13) £A*«<+oo 

k 


Theorem 8*5* A bounded linear operator A defined on the 
(separable*) Hilbert space is a Hilbert-Schmidt operator if and only if 
A*A is of trace class. 

Proof. If A is a Hilbert-Schmidt operator, then by (8.3) 

(8.14) £ <** I A * Ae *> = III Ae * II 2 = I V- 

k k k 

In the case where the orthonormal system is not already a basis, 
we can complete it to an orthonormal basis in by adding to it an 
orthonormal basis {e{} in the orthogonal complement of the closed linear 
subspace [e ± , e 2 ,...] spanned by {e fc }. Since Ae? = 0, and therefore also 
(e{ | A^Aef fs ) = 0, we have by virtue of (8.14) and Lemma 8.3 that 
the trace of A*A exists and 

(8.15) Tr[A*A] 

k 

Consider now any bounded linear operator K for which Tr(-/£*if) 
exists. Since any normalized vector / e Jf 7 can be chosen as the first 
element of an orthonormal basis in we have 

II Kf \\ 2 = <f\K*Kf> < Tr[ls:*iq. 


* We give the proof only for the separable case, but the result is true in general (see 
Hchutton [I960]). 
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Thus 

(8.16) || K\\* - sup || Kf\\* < Tr [K*K\. 

ll/ii=i 

Assume that A is a bounded linear operator with Tr[A*A] < +oo, 
and let us prove that A is a Hilbert-Schmidt operator. Since the proof is 
trivial when Jf? is finite dimensional, we shall consider only the case of 
operators A on infinite-dimensional separable Hilbert spaces. 

Choose an orthonormal basis {e x , e 2 ,...} in and define the linear 
operators 


A„ = Z I Ae k y(e k \ , n = 1 , 2,... . 

k= 1 

Using (8.16) and noting that A n ej = Ae j if j ^ n, we get 
II A-A n || 2 < Tr [(A - A n r (A - A n )} 


= Y J \\(A-A n )e k \f= £ \\Ae k \ 

k= 1 Jc=n +1 


Thus, || A — A n || -> 0 when n -> oo. Since A x , A 2 ,... are completely 
continuous operators, it follows from Theorem 8.2 that A is also 
completely continuous. It is easy to reestablish (8.15) by using the earlier 
procedure, and thus prove that A is a Hilbert-Schmidt operator. 

Q.E.D. 

For any Hilbert-Schmidt operator A , the quantity 
(8.17) || A ||, = VTr [A*A] 

exists, and is called the Hilbert-Schmidt norm of A. As a matter of fact, 
using first the triangle inequality in and then the triangle inequality 
in Z 2 spaces we obtain (see also Theorem 8.10) 

II & 1 A 1 + <V^2 II 2 = (Z W( a l A l + <V^2) e Tc II 2 ) 

<(Z [IIMi^ll + IIMAll] a ) 1/a 

X k ' 

< (ZII Ma ll 2 ) 1/2 + (ZIIMA ll 2 ) 1/2 


= i^iiii4 1 ii i + i« t iiii4 i ii,. 
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Moreover, according to (8.16), 

(8.18) M|| <(Tr[J*J])^ = |M II,, 

which shows that || A || 2 = 0 implies A = 0. 

8.4. The Trace Norm and the Trace Class 

Let us consider now a completely continuous operator A, having the 
decomposition (8.3), for which 

(8.19) Tt[VA*A] = Tr W \ e k> X k <e k |] = £ 4 < +oo. 

L ]c J ]c 

If the above series is infinite, its convergence implies that X k -> 0 when 
k -> + oo. Consequently, X k 2 < X k for sufficiently large values of k. Hence, 
Zfc A fc 2 converges when converges. This shows that any completely 
continuous operator A satisfying (8.19) is a Hilbert-Schmidt operator. 

Lemma 8*4* If A is a completely continuous operator on the 
Hilbert space having the canonical decomposition (8.3), then 

(8.20) E A, = sup E \<J t I A gi y\, 

7c i 

where the supremum is taken over all choices of countable orthonormal 
systems {/«} and {&•}. 

Proof. Using (8.3) we get 

K/<l 4 ft>l = l<fi\e k '>K (e k \ gi ) 

Jc 

< E A * \<fi I oo* ift>i 

7c 

< * I MK/i I «*'>l 2 + l <gt I e k >n 

7c 

Applying Bessel’s inequality (see Chapter I, Lemma 4.1) to the ortho¬ 
normal systems {/J and { gi } we obtain 

EK/ilOl* < Ill'll 2 = l. 

II<ftl«*>l 2 <Kll a = I- 


( 8 . 21 ) 
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In proving Lemma 8.3, we used theorems on the convergence of 
double series. In view of (8.21), we can apply those theorems to justify 
the reversal of the order of summation in the following computation: 

X Kfi I Agi) I ^ i X X I e k >\ 2 + \<gi I e k)\ 2 ) 

i i Jc 

= i z A j£ (K/i I **'>l 2 + \<gi I ^>I 2 1 

Jc L i J 

<£A fc . 


Since we have 


£ 10*'| ^>1 = £A fc , 

fc k 

we conclude that (8.20) is true. Q.E.D. 

We shall now show that the functional 

(8.22) || A || x = Tr [VA*A\ = £ A, 

k 

on the class of Hilbert-Schmidt operators satisfying (8.19) is a norm, 
called the trace norm. 

Since for every series with positive terms 

£ A* 2 < (£ A*) 2 , 

k v k 

we obtain 

(8.23) II A || 2 < || A Hi . 

Hence, || A ||j_ = 0 implies that || A || 2 = 0, and consequently A = 0. 
For any complex number a , 

|| aA ||i = Tr[(| a | 2 A*A?*\ = \a\ Ti[VA*A] = \ a | \\ A ||i. 

Finally, by making use of Lemma 8.4 we obtain 
|| a x + a 2 Ik = sup £ |</ 4 \{A X + A 2 ) gi y I 

i 

< sup £ |</i I ^<>1 + sup £ |</i I Azgi )| 

i i 


= 11A Hi+ 11 A Ik. 
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Thus, || • Hj satisfies all the conditions of a norm on the class of Hilbert- 
Schmidt operator for which (8.19) is true. Theorem 8.7 states that this 
class is identical to the trace class of Definition 8.2 when is a separable 
Hilbert space. To prove it, we need the following result. 

Theorem 8*6* Any self-adjoint operator of trace class has a pure 
point spectrum. 

Proof. Any self-adjoint operator A of trace class can be written as a 
difference of two positive-definite self-adjoint operators A x and A 2 , 

A = f X dE x = A<+> - A<-> 

J u 1 

XdE A , A^ = -\ XdE x , 

[0,+oo) J (—oo,o) 

both having a finite trace (see Exercise 8.6). Consequently, it is sufficient 
to prove the theorem for positive-definite operators. 

Assume that A is positive definite, and choose any A 0 > 0. Let 
Ai < A 2 < • • • < X n be n numbers greater than A 0 such that 

(8.25) E Xi ^ E Aq , E k% ^E kv ..., E k ^ E kn _ x . 

Since E kk — E^ ^ 0, there are n normalized vectors f x ,...,/ n such that 
(E\ k — fk = fk 9 k — !>•••> n - 
According to the spectral theorem we have 

(8.26) </* I Af k y = f Xd<f k \EJ k > 

J or 

= f ^d(f k \E k f k y 
^ A 0 f d(f k | E k f k y = A 0 . 

By completing/! ,...,/ n to an orthonormal basis/! ,/ 2 ,..., in we get, 
using the positivity of A and (8.26), 


(8.24) 

A<+> = 


Tr A =£<f k \ Af k ) >£<f k \ Af k ) > «A„ . 

k k=*l 

The above inequality shows that the number of points A x X n for which 
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(8.25) is true is smaller than Tr Aj A 0 , i.e., the spectral function E x can 
change only at a finite number of points in (A 0 , -f- oo). Thus, A has 
exclusively a point spectrum in (A 0 , + oo), and since A 0 > 0 can be 
chosen arbitrarily small, A has a pure point spectrum. Q.E.D. 

Theorem 8*7* A completely continuous operator A on the Hilbert 
space ^ is of trace class if and only if it is a Hilbert-Schmidt operator 
for which 

Tr[VA*A] = £ A* < +oo, 

k 

where X k are the numbers appearing in the decomposition (8.3) of A. 

Proof. The positive numbers \ k occurring in the decomposition 
(8.3) of a Hilbert-Schmidt operator A are the eigenvalues of V A*A. 
Hence, if A* = A, we must have e k = e k when X k e S A , and e k = — e k 
when — X k e S A . In the decomposition 

A=A<+>-A<~>, A<±>= £ I e k> X k (e k \, 

±A k eS A 

the operators A {±) are positive definite. Hence, we can write 

j<±> = Va w Va^, 
and apply Lemma 8.3 to infer that 

Tr A^= £ K<^X k 

±A Jfc eS ' 4 k 

exists if ZfcAj. < +oo. Consequently 

Tr A = Tr A<+> - Tr 


also exists. 

The more general case of nonsymmetric A can be reduced to the 
above case by setting 

A = A x + iA t , A x = i(A + A*), A 2 = ~(i/2)(A - A*). 

In fact, the operators A x and A 2 are obviously symmetric Hilbert- 

Schmidt operators, for which we get 

Tr (VASA) = || A t It < || A |t + || A* |t < +oo, 

T liVAtt) = II A* Hi < II ~iA It + II iA* It < +oo. 

using the properties of the trace norm || • \\ t . 
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Consider now a self-adjoint operator A of trace class. According to 
Theorem 8.6, A has a pure point spectrum. Hence, 


(8.27) 


A= X dE x A = Yj XE a ({X}\ 

J U 1 \eS* 


where the convergence of the above series is uniform if the series is 
infinite. Each characteristic subspace M A corresponding to a nonzero 
eigenvalue A e S A has to be finite dimensional; in fact, if that were not 
true for some A ^ 0, we would have Tr[E^({A})] = +oo, which would 
imply that the trace of A does not exist. Choosing an orthonormal basis 
in each characteristic subspace M A , A e S A , we obtain, by taking the 
union of all such bases, an orthonormal basis {e x , e 2 ,...} in . Since 
each M a is finite dimensional, (8.27) yields 


(8.28) A = £ I £&> X k (e k \, 

&=i 

where X k ' is the eigenvalue of A for the eigenvector e k . 
In taking the trace of A , 


Tr A — £ < e k | Ae k y — ^ X k f , 

we can use any orthonormal basis, and therefore any order of e x , e 2 ,... 
in the above series. Hence, the series ££=1 X k converges unconditionally, 
and consequently it converges absolutely (see Randolph [1968, Section 
3.9]): 


X K < +°°j x k — | X k |. 

It follows from (8.28) that A is a completely continuous operator. 
Since the convergence of £j£=i X k implies the convergence of j^ k=1 X k 2 
(A fc 2 < X k for sufficiently large k ), we easily deduce that A is a Hilbert- 
Schmidt operator. 

If A is any operator of trace class, then obviously A* is also of trace 
class: 


Tr A* = £ <e k | A*e k > = % <e k \ Ae k >* = (Tr A)*. 

1 1 

Hence, A 1 — ^(A + A*) and A 2 = —(i/2)(A — A *) are self-adjoint 
operators of trace class, and consequently they are Hilbert-Schmidt 
operators for which || A 1 1|, < + 00 and || A 2 || x < + 00 . This implies that 
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A = A x + iA 2 is also a Hilbert-Schmidt operator. Moreover, since 
|| • Ih as defined in (8.22) is a norm, 

Tr VA*A = || A Ik < || A, || t + || |k < +oo, 

i.e., ^k converges. Q.E.D. 

8.5. Statistical Ensembles and the Process of Measurement 

When we were discussing in §1 the concept of preparatory measure¬ 
ment in quantum mechanics, we saw that for n compatible observables 
represented in the Schroedinger picture by the operators A x ,..., A n it 
is usually not the case that we can prepare at an instant * 0 n absolutely 
accurate simultaneous values Ai 0) ,..., A^ 0) for A x A n . Instead, in 
general, a preparatory measurement yields the weaker result that the 
simultaneously prepared values of A x ,..., A n are within an n-dimensional 
Borel set B. Therefore, after a preparatory measurement we have to 
assign to the system the family of all states W{t) which satisfy (1.26). This 
family contains, in general, more than one element. However, in practice, 
instead of dealing with such a family of state vectors, one prefers to 
make some reasonable statistical assumption based on some additional 
knowledge or insight which one might have regarding the ensemble of 
systems used in the experiment. 

In order to simplify the discussion from the mathematical point of 
view, let us assume that A x ,..., A n represent a complete set of observables 
with pure point spectra, and that for every A = (A x A n ) e S^ 1 X *** X S An 
we can choose a normalized vector e x such that A k e x = X k e x , 
k = 1,..., n. Consequently, the family of all vectors e x , A e S^ 1 X ••• X 
S^ n , constitutes an orthonormal basis in 

Assume that the preparatory measurement which has been carried out 
on the system at t 0 provides us with the information that the prepared 
simultaneous values of A x ,..., A n are at t Q within the Borel set B , where 
ECS^ 1 X * * * X S An . Let us further simplify the considerations by 
assuming that B is finite. Then the prepared family of states consists of 
all the states W(t) satisfying 

^(t 0 ) = e x , A eB, 

and any linear combination of such states. 

Suppose that we submit a large ensemble of identical systems to the 
described preparatory procedure, and that we know that in this ensemble 
(100 o>a)% of the systems have at time t 0 the simultaneous values 
AiA n , (A x A n ) e B, for A x ,..., A n . As we already know, the theory 
predicts that the mean value of the outcome of a determinative measure- 
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ment at time t > t 0 of an observable represented by a bounded operator 
A 0 is 

(8.29) <^ 0 >n(0 = <Ta«I W0> 

if the measurement is carried out only on that part of the ensemble 
consisting of particles which at t 0 had the simultaneous values A x X n 
for A x A n , respectively. Hence, the mean value of A 0 for the entire 
ensemble is 

(8.30) <A 0 ) eDS (<) = £ «> A <^(0I A 0 V,(t)>. 

AeB 

Let us introduce for each instant t the operator 

(8.31) p(t) = X ¥n(«) - E, n(t) = | y A (0>0P,(*)l. 

AeB 

which is called the density operator or the statistical operator of the 
considered ensemble. 

From the orthonormality property of the set of all vectors ^(J), 
A e S Al X ... x S'* 1 *, at any fixed t, it follows immediately that 

(8.32) <A 0 ) ens (t)= X <Wi A oP (t)W x (t)y. 

AeS'^ 1 X***XS'^ n 

The above relation can be written in the form 

(8.33) <^o>e ns (0 = Tr [A oP (t)] = Tr[p(t)A 0 ], 

provided that A 0 p(t) is of trace class. Theorem 8.8 shows that this is 
indeed the case. 

Theorem 8*8* Suppose {e x , e 2 ,...} is an orthonormal basis in J#*. If 

(8.34) A =£K>A fc < e *|, 

k 

where X k ^ 0 and 

(8.35) Tr [A] = £ A, < +oo, 

k 

and if A 0 is a bounded operator, then A 0 A is of trace class. 

The above theorem can be easily derived from Theorem 8.7 by 
showing that the operators (A 0 A)*A 0 A and (AA Q *A Q A) 1 t 2 are of trace 
class. As a matter of fact, it is sufficient to show that (AA 0 *A 0 Ay / 2 is of 
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trace class, since it is easy to see that in that case AA 0 *A 0 A is also of 
trace class. 

To establish that (AA 0 *A 0 A) 1 / 2 is of trace class, note that 
(AA 0 *A 0 A^ = (AA 0 *A 0 A)V*(AA 0 *Z 0 A)M 

Then it follows from Lemma 8.3 that the sum 

£ < e * \(AA 0 *A 0 Ay'* e k ) 

Tc 

has the same value in any orthonormal basis {e 1 , e 2 Let e x , e 2 be 
the vectors appearing in (8.34). We have 

<«* \(AA 0 *A 0 A) 1/2 e k > < || (AA 0 *A 0 A)v* e k || 

= «e k | AA 0 *A 0 Ae k ))^ 

= X k (e k \A 0 *A 0 e k )^^X k \\ A 0 \\, 


and consequently 

£ (e k | (AA 0 *A 0 A)v* e k > < || A 0 1| £ X k < +oo. 

Tc Tc 

Thus, (AAq^AqA) 1 / 2 is of trace class and Theorem 8.8 is established. 


8.6. The Quantum Mechanical State of an Ensemble 

Theorem 8.8 establishes that (8.32) can be written in the form (8.33) 
when A 0 is bounded, since p(t) is a positive-definite operator with a pure 
point spectrum and of finite trace: 

Tr[p(0] = I <WI Pi*) ^(0> = 1- 

A 

In the experiment described at the beginning of Section 8.5, in 
which we were concerned with a statistical description of an ensemble 
of identical independent systems, the relevant information about the 
outcome of any measurement can be obtained from the operator-valued 
function p(t); thus, it is justifiable to state that p(t) represents the state 
of the ensemble in the Schroedinger picture. 

In the light of the above discussion the following postulate is a natural 
generalization of (8.31). 
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Axiom E* Consider a statistical ensemble of identical systems, and 
let Jf 7 be the Hilbert space associated with each system in the ensemble 
(Axiom O, §2). Any state of the ensemble can be represented in the 
Schroedinger picture by an operator-valued function p(t) 

oo n 

(8-36) P {t) = £ w*£V t(< ) = u-lim £ w k E vU) , 

k ~i *ti 

where E Vk {t) = | W k (i)') (W k (t) |, 

(8.37) £ «>* = 1, 0, 

fc=l 

and {^(J), ¥ / 2 (^),.**} is an orthonormal system of state vectors whose time 
dependence is governed by the relation 

(8.38) 'F k (t) = U(t,t 0 )'F 1c (t 0 ), t,t 0 eU\ 

where U(t> t 0 ) is the evolution operator of the systems in the ensemble. 

We note that the uniform limit in (8.36) exists on account of (8.37). 
This can be seen easily if we introduce 


P = s-lim £ w k E Wk = E I ¥*> «’*<¥'* I 

fc=l fc=l 

by means of the above strong limit, which obviously defines a linear 
operator. Since 


P ~ E W kE'i' k 




E w k I 

k=n +1 


E 




OO 


E w k 

k=n +1 


converges to zero when n-> + oo, p is the uniform limit of the partial 
sums in (8.36). 

An ensemble in the state (8.36) is viewed as consisting of systems in 
the states ^(f), W 2 (t),... in the respective proportions w 1 , w 2 ,... . If 
w k = 1 for k = k 0 and therefore w k = 0 for k ^ k 0 , the described state 
of the ensemble is one in which each of the systems is in the state W k ( t ). 
In that case p(t) is called a pure state of the ensemble. If a few of the w k 
arc nonzero, p(t) is called a mixed state or a mixture of all the single¬ 
system states x ¥ k {t) for which w k > 0. 

It is easy to derive from (8.36) and (8.38) 


( 8 . 39 ) 


KO r/(Mo)p(*o) u-\t.t «). 
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To see that, take any /e and note that for any k = 1,2,... 

= W)l/> 

= c/(/, * 0 )w 0 )l c/-i(<, <„)/> yyg 

= U(t, t 0 ) Ey M U-\t, t 0 )f. 


which shows that 

E^it) = U(t, t 0 ) Ey^) U-\t, 1 0 ). 
From the above relation we get 


n 


X W kE^ k (t) — 

Tc =1 


E/(Mo)(Z 




t 0 ). 


In the limit n —► + oo, we obtain (8.39) by using the basic rules for dealing 
with uniform limits (see Exercise 8.7). 

In view of (8.33) and of Born’s correspondence rule in Definition 1.3 
it is natural to assume that if a determinative measurement of a bounded 
observable A 0 is carried out at time t on each system in an ensemble in a 
state p(t), then the mean value of A 0 will be Tr[p(£) A 0 ]. Thus, as a 
generalization of (1.27), we can state that the ratio of the number 
°f systems having at t simultaneous values of A 1 ,..., A n 
within the set B , to the total number N of systems in the ensemble, 
should be equal to 

(8.40) ^ Tr[pW e a, . A n{B)] = pA,....A n{B) 


If a preparatory procedure on the systems of an ensemble does not 
leave the ensemble in a pure state, i.e., if there are more than one linearly 
independent state vectors satisfying (1.26) in the preparatory measure¬ 
ment of a general type described in Definition 1.4, then usually the 
principle of equal a priori probabilities is invoked in order to assign to the 
ensemble a state. According to this principle, the proposed state (8.36) 
of the ensemble is the one in which equal nonzero weights are assigned to 
the single-system states x P k (t) satisfying (1.26), and zero weights to those 
states not satisfying (1.26). Thus, if ¥ / 1 (^),..., W n (t) are all the mutually 
orthogonal states satisfying (1.26), i.e., those states for which 


”(B)v k (t 0 )y = l, 


Wo) I E Al 


A, 
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or equivalently 


E A ' . \B) W k (t 0 ) = W k (t 0 ), 


then obviously 


P( t o)=liE*M=l t E A ' . An (B). 

Theorem 8.9 provides a simple algebraic criterion for distinguishing 
between pure states and mixed states of a statistical ensemble. 

Theorem 8*9* The state p(t) of an ensemble is a pure state if and 
only if p(t) = p\t). 

Proof ’ If the ensemble is in a pure state in which the states of each 
system in the ensemble is W(t ), the statistical operator p(t) is at each 
instant t the projector onto W(t ), and therefore 

(8.41) P \t) = P (t). 

Conversely, assume that (8.41) is true. If we write p{t) in the form (8.36) 
and compare it with the following expression for p 2 (t) (see Chapter III, 
Exercise 5.10) 


, n >2 

P 2 W = u-lirn y t (t>) 

n oo 

= u-lim £ w k 2 E Vk ( t ) = Y 

n ^°° fc-1 fc=l 

we obtain w k 2 = w k . Thus, w k can assume only the values 0 and 1. This, 
in conjuction with the condition w k ~ shows that w k = 0 except 
for one value k 0 of k at which = 1. Q.E.D. 

We have arrived at the concept of “state vector in the Schroedinger 
picture” by considering the possible values of all states ¥*(£) of the system 
at a given instant. In this same manner we arrive at the concept of 
statistical operators by considering the possible values of a state p(t) of an 
ensemble at instant t. 

Definition 8*4* Let be a separable Hilbert space associated with 
a quantum mechanical system. A linear operator p on is a statistical 
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operator ( density operator) if it can be written in the form 

n oo 

(8.42) p = u-lim £ w k E Wk = £ w k E Vk , 

Tc =1 fc=l 

where the real numbers w x , w 2 ,... satisfy (8.37) and , W 2 ,...} is an 
orthonormal system of state vectors in Jf. 

Any statistical operator p is obviously a positive definite self-adjoint 
operator. Hence, according to Lemma 8.3, Tr p exists if the sum 
Zi < e i I P e i ) is finite in some orthonormal basis {e x , e 2 ,...}. We can 
obtain such a basis by completing , W 2 ,...} to an orthonormal basis 
in Thus, we have 

(8.43) Tr P = £ <W k \ = f>» = 1. 

fc=l fc=l 

Conversely, Theorem 8.7 tells us that in the absence of superselection 
rules any positive self-adjoint operator of unit trace is a density operator. 


8.7. The von Neumann Equation in Liouville Space 


Let us concentrate now on the case of time-independent Hamiltonians 
H; i.e., in physical terms, on the case where each element of the ensemble, 
whose density operator is described (in accordance with Axiom E) 
by p(t), is isolated from its environment so that its total energy is con¬ 
served. In that case U(t , £ 0 ) in (8.39) is given by (3.2). Differentiating 
the right-hand side of (8.39) formally, we arrive at the von Neumann 
equation : 


(8.44) 


= ih h-lim 


At)-p{t) 


dt 


At 


= [H, P (t)]. 


This equation bears a remarkable formal resemblance to the Heisen¬ 
berg equation (3.25), but there are both mathematical and physical 
distinctions: in (3.25) A(t) is in general an unbounded self-adjoint 
operator representing an observable in the Heisenberg picture, whereas 
in (8.44) p(t) is a (self-adjoint) trace-class operator of unit trace and of 
bound 

MOII = su p w k < i> 

fc-1,2,... 

and this p(t) represents a (mixed or pure) state in the Schroedinger 
picture. Hence we have to study the precise meaning of the limit on the 
left-hand side and of the commutator on the right-hand side of (8.44), 
respectively, on their own rather than as corollaries to the results of §3.4. 
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Such a study is greatly facilitated by the fact that the family jSf(Jf') 
of all Hilbert-Schmidt operators on (called in mathematical literature 
the Hilbert-Schmidt class* and in physical literature the Liouville 
space) is a Hilbert space, and that as a trace-class operator p(t) belongs 
to by Theorem 8.7. 

Theorem 8*10* The family of all Hilbert-Schmidt operators 

on a given (separable) Hilbert space is itself a (separable) Hilbert 
space (called the Liouville space over 3?) with respect to the inner product 

(8.45) (A, | A 2 ) 2 = Tr(A 1 *A 2 ), A ly A 2 e 

Proof ’. The argument at the end of §8.3, which establishes the fact 
that (8.17) is a norm, simultaneously establishes that ) is a linear 
space. Hence, if A ly A 2 g J£?(^ f) then A 1 + A 2 , A x + iA 2 e JSf(^f), 
and therefore 

(8.46) A*A 2 = i(A, + A 2 )*(A, + A 2 ) + {2i)~\A x + iA t )*(A ± + iA 2 ) 

-|(1 -i)(A^A 1 + A 2 ^A 2 ) 

is of trace-class by Theorem 8.5, so that (8.45) exists. The linearity 
features of the trace operation, 

(8.47) Tr faAM + a 2 A < 2 >) = a x Tr A™ + a 2 Tr A < 2 > 

that is evident from + (8.8), immediately leads to the conclusion that 
(8.45) has the properties (2)-(4) in Definition 2.1 of Chapter I, whereas 
property (1) follows from (8.18). 

To establish that the Euclidean space is complete and separable 

if is a separable infinite dimensional Hilbert space,* we introduce 


# In Definition 8.2 we had restricted ourselves to separable Hilbert spaces, which 
are of exclusive interest in quantum mechanics, but that definition can be extended to 
the nonseparable case (see Schatten [I960]). 

• The trace-class is often denoted by the Hilbert-Schmidt class by 

the class of compact operators by ^oo(^), the C # -algebra of Theorem 1.1 in 

Chapter III by We then have C C • • • C C where 

.'^(Jf'), and ^ , 00 (<# 5 ’) are Banach spaces (and, moreover, so-called minimal norm 

ideals in ss ^l(.^f’)) with respect to the trace-norm || • || x , Hilbert-Schmidt norm 

|| ■ ||u , and operator bound || • ||, respectively). However, among all ^ , z (^f > ), i — 1, 2,..., go 
(where appropriate classes and norms are defined for i = 3, 4,...), only ^ , 2 (<*f’) = 
is a Hilbert space (see Schatten [I960] for details). 

* Set* Schatten [I960] for the ease when M' is nonseparable. 
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an orthonormal basis {e x , e 2 ,...} in - '//, and note that for any 
A lt A a e 2[X) 

00 00 00 

(8.48) (A, | A 2 ) 2 = X <e k | A 1 *A 2 e k ) = X X <«, | ^ 1 e 4 >*<e i | A 2 e k ). 

Consequently, the mapping assigning to As the element 

{{<i e i | Ae k )} ifk in the separable Hilbert space / 2 (oo 2 ) = / 2 (oo) 0 / 2 (oo) 
(see Theorem 4.7 in Chapter I and Theorems 6.9 and 6.10 in Chapter II) 
of matrices { a ik } with inner product 


««}> 


La ik tk 


is an isometric transformation of into / 2 (oo 2 ). To see that this 

transformation is actually unitary note that by Theorem 8.5 As 
if and only if 

00 

(8.49) X |<>i I Ae k }\ 2 = Tr(A*A) < oo. 

i,k=l 

Thus is isomorphic to / 2 (oo 2 ), and therefore it is a separable 

Hilbert space. Q.E.D. 

In physical literature operators acting on a Liouville space 
are called superoperators (whereas in mathematical literature either the 
term operator is retained, since after all is a Hilbert space itself, 

or the term transformer might be applied). As examples of super¬ 
operators acting on A g ^(J^) consider 

(8.50) U A = UAU *, U* = U-\ 

(8.51) HA = hr\HA - AH ), H = H*. 

The superoperator U has domain = jSfpf*) and is unitary. Indeed 
by (8.46) and (8.11) the infinite double series in (8.48) converges uncon¬ 
ditionally, so that by setting A = A£ and B = A 2 we conclude that 


(8.52) 


Tr (AB) = £ O* I ABe k) = X Z <A I Ae iX e i I Be k> 

1 fc=l «= 1 


00 00 00 

= X X < c < I Be *><e k I Ae «> = X <«< I BA X) = Tr(BA) 
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for arbitrary A> B e Therefore, 

<U A | U A\ = Tr [U(A*AU*)] = Tr[( A * Jt/*)t/] = < A \ A\ , 

and since obviously U ~ X A = U*AU , we also have that 
which establishes the unitarity of U. 

Similarly, assuming that in (8.51) H is some bouunded self-adjoint 
operator with S) H = (8.52) yields 

*<A X 1 1IA 2 \ = Tr[A 1 *{HA 2 - A 2 H)] 

= Tr[(HAJ*A 2 - HA x *A 2 ] = h<HA 1 \ A 2 \ , 

so that H is a self-adjoint superoperator on <jS?(^f). On the other hand, 
if H is unbounded we run into technical problems with domains of 
definition, so that we have to be more precise as to the meaning of H 
in (8.51). The following result illucidates, however, the relationship 
between the domains, spectra, and spectral measures of the operator H 
on 3/F and the superoperator H on ^(Jf). 

Lemma 8*5* If H is any self-adjoint operator in Jf’, then the super¬ 
operator H acting in ) as follows 

(8.53) HA = hr\H, J]** = h~\HA - AH) 

and with domain Sf H consisting of all A e for which HA , AH e 

jSf(^f), is self-adjoint in the Liouville space furthermore, 


S H 


A' - A" 


: A', A" e S* 


S H 


A' - A" 


: A', A"eS p " 


and the action of the spectral function E A H of the superoperator H 
on any A e can be expressed in terms of the spectral function E X H 
of the operator H by means of the following spectral integral* 

/* -j-oo 

(8-54) EfaA = I E" U A d u E, 

oo 

If we denote by h-lim the limit in the Hilbert-Schmidt norm on ^(Jf*) 
(so that A = h-lim A n means that || A — A n || 2 -> 0 as n -> + oo), 
then we easily deduce from Theorems 3.1 and 8.4, Lemma 8.5, and the 
properties of (8.50) that the following is true (see Prugovecki and Tip 
[1975]). 


# Sec §3.7 in Chapter V for a definition of spectral integrals. The proof of (8.54) is a 
corollary of Theorem A.2 by Prugovecki [1972b], whereas the proof of the remainder of 
Lemma 8.5 follows from the results in Section 3 of PrugoveCki and Tip [1975]. 
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Theorem 8*11* If H is any self-adjoint operator in ^ and U* is 
the superoperator that acts on all A e as follows, 

(8.55) l J t A = e ^ m Ae-^ H \ t e OP, 


then U* is unitary on ££{£?) and 


(8.56) 

^*1^*2 9 ^1 9 

t 2 e R 1 , 

(8.57) 

U 0 A = A = h-lim U t A, 

0 t-* o * 1 

AeS?(jr), 

(8.58) 

iHA = h-lim(U^ - 

A)/t. 


We recall how Theorem 3.1 was used in clarifying the meaning of 
the Schroedinger equation (3.1) and we observe that the same arguments 
lead to the conclusion that 


u(*, g = u ,_* 0 = iw ; 1 


leaves invariant, and that 


(8.59) 


dA(t) 

dt 


= h-lim 

A t -> 0 


A(t + At) — A(t) 
At 


H A(t) 


whenever A(t 0 ) e & h and A(t) evolves in accordance with (8.39): 

(8.60) A(t) = U (t, t 0 ) A(t 0 ) = U(t, g A(t 0 ) U*(t, t 0 ). 

When we now compare (8.44) with (8.53) and (8.59) we immediately 
uncover the mathematical meaning as well as the physical significance of 
von Neumann’s equation: (8.44) is in fact a Schroedinger equation on 
Liouville space, which consequently governs not the time evolution of 
state vectors (i.e., merely of pure states), but that of density operators 
(i.e., of either pure or mixed states)! 


8.8. Density Matrices on Spectral Representation Spaces 

In quantum mechanics actual computations of quantities describing 
the behavior of a given system are performed in some spectral represen¬ 
tation space L 2 (R n , fi) of a complete set of observables {A x ,..., A n j for 
that system. Such spaces were introduced in the abstract in Definition 
5.2, and concrete realizations of particular importance were encountered 
in §7 in the form of configuration, momentum, and angular momentum 
representation spaces. 
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The same observation stays true in quantum statistical mechanics 
by virtue of the identification of the Liouville space JS?/^) over 
= L 2 (R n , fi) with the Hilbert space L 2 (U 2n , p X fi). 


Theorem 8*12 ♦ To each Hilbert-Schmidt operator A in the 
Liouville space ~S?(J f) over = L 2 (R n , (jl) there corresponds a unique 
h A gL 2 ( M 2n , fi X fi) such that 

(8.61) </ \Ag}= ( /*(*') h A [x\ x")g(x") d^x') d^x") 

for all/, gsL 2 {R n y p), and the mapping Av->h A is a unitary trans¬ 
formation of JSf(L 2 (IR n , fi )) onto L 2 (U 2n y fi x (jl ). 

Proof. If e k (x) y e k '(x) GL( 2 )(U n , fju) are functions representing the 
elements e k , e k eL 2 (IR n , p) in the canonical representation (8.3) of 
A g ^(Jf"), then we observe that by (8.14) 

m n 

II A m - A n li; = £ A»», ,4, = II e k ’ >A fc < |, 

and if we set 

h A y, x") = Z e k '(x') A fc e fc *(x"), 

then 

</1 = f /*(*') *") *(*") <W*') 

J R 2n 

(8.62) 

II ^ - A J| 2 2 = f | h Am (x\ x") - h An (x\ x ")\ 2 d^x’) <W*T). 

J B5 2n 

Consequently, by the Riesz-Fischer theorem (Theorem 4.4 in 
Chapter II), 

(8.63) h A (x\ x”) = l.i.m. h A (x' y x") 

n->co n 

exists and belongs to L( 2 )(R 2n , n X fi ). On the other hand, by (8.3) 
and (8.4), 


</1 Ag} = lim</1 A n g\ /, g /x), 

and since f{x') g*(x") eL( 2 )(U 2n , jjl X ju,), we deduce from (8.62) that 
(8.61) is satisfied. 
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The function h A (x\ x") is not uniquely determined by A since, in 
general, the canonical decomposition (8.3) is not unique [e.g., that is 
the case if some X k is a degenerate eigenvalue of some A = A* g 
However, if h A {x\ x") also satisfies (8.61), then we have 

f mx')g*{x'')rih A {x\ *") - h A '(x\ x'')] d^x') d^x") = 0 

for all /, g g — L 2 (U n , fi ), and since by Theorem 6.9 in Chapter II 
the linear manifold spanned by all functions f(x f ) g*(x") is dense in 
L 2 (R 2n , p X fj), we conclude that h A (x\ x") and h A (x\ x") have to be 
almost everywhere equal with respect to /jl X and therefore they 
represent one and the same element of L 2 (R 2n , fi X fi ). 

Thus A i-* h A is defined for all A g ^(Jf"), and it obviously represents 
a linear mapping. To prove that this mapping is an isometry we note 
that by (8.14) 


Tr (A*A) = lim £ A** = lim Tr (A n *A n ) 

k=l 

= lim (* | h A (x' y x ”)\ 2 dfjb(x f ) dfjL(x")> 

n ^°° J u 2n 


and consequently, by (8.17), 

(8.64) \\A || 2 = f f | h A (x\ Ol 2 d^x’) d^x")] 1 ' 2 = || h A || L 2 . 

The unitarity of this same mapping is finally established by noting that 
to any element h{x f , x") of L( 2 )(R 2n , p X /jl) there corresponds a unique 
operator A such that when h A is replaced by h in (8.61) the outcome is an 
equality valid for all/, g eL 2 (IR n , /jl) (see Exercise 5.3 in Chapter V). 

Q.E.D. 

If p g j£?(L 2 (IR 2n )) is a density operator, then (8.3) assumes the form 
(8.42), and we shall write 

(8.65) <*' | p | *"> = h p (x'> x") = l.i.m. f W k (x') w k V k *(x"). 

Tc =1 

Following a long-established tradition in physics literature, we shall call 
the function (x f \ p | x ff y a density matrix on the spectral representation 
space L 2 (R n , ju,)—albeit this function can be identified with a matrix 
only in the case where the spectra of A x ,..., A n in Definition 5.2 are all 
point spectra so that the support S of the measure /x is a discrete set. 
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This last case, however, almost never occurs in practice. For example, 
in the important case of the configuration to the representation (6.8) 
of the canonical commutation relations (concretely, if in the consider¬ 
ations of §7.2 we have N = nj 3 spinless particles), the Hilbert space 
equals L 2 (R n ), and therefore (x! \ p \ x "> is an element of L( 2 )(U 2n ). 

To understand the physical significance of the density matrix, we 
note that the probability appearing in (8.40) can be expressed as follows: 

(8.66) . \B) = lim £ w k £ | 'P k (x)\* d,,{x). 

Indeed, choosing an orthonormal basis e 1 , e 2 ,... in the subspace of 
L 2 (U n y p) onto which projects, and then completing it to 

an orthonormal basis in L 2 (U n , p) by adding to it e^ y e 2y ... with 
supp e^x) C B f = U n — B y we obtain 

(8.67) Tr \pE^ \B)] = £ <«, | P e i} 

i 

= t ^Y J <e,,\V k )(P k \e i ) 

7c=l i 

through the use of theorems on the unconditional convergence of double 
series with positive sums [already used in (8.11)]. The right-hand sides 
of (8.66) and (8.67), however, coincide. 

We now set by definition 

(8.68) <* I P I *> = £ I Wl ! 

1 

at all x gU u where the above series converges, and by Lebesgue’s 
monotone convergence theorem we obtain in accordance with (8.66) 
the basic result 

(8.69) P? . \B) = Tx\pE Al . \B)] = £ <* | P | *> M*). 

which represents the most general integral-form expression for the 
probability that a simultaneous determinative measurement of the com¬ 
plete set {A 1 ,..., A n j of observables of a system in a (pure or mixed) 
quantum state p would yield values in the Borel set Bs3$ n . For the 
special case B — U n (8.69) assumes the form of a trace formula: 

Tr p = I (x | p | X s ) dp(x). 

J M n 


(8.70) 
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It should be emphasized that in general the defining formula (8.68) 
of the so-called diagonal elements (x\ p \ x) of the density matrix 
| p | x"y is not derivable merely by setting x f = x" — x in (8.65). 
Indeed, (8.65) involves a limit-in-the-mean, which in general implies 
pointwise convergence only almost everywhere on R 2n for some suitable 
subsequence of the partial sums of the given infinite series (see 
Chapter II, Theorem 4.5), whereas the set {(#', x") \ x — x "} might 
be of ^--measure zero in U 2n (e.g., that is the case when p is a Lebesgue- 
Stieltjes measure). On the other hand, all the terms of the series in 
(8.68) are nonnegative and 

f E I w id x )\ 2 M x ) < E w * < 1 

so that (8.69) is true by Theorem 3.10 in Chapter II (cf. also Fatou’s 
lemma). Furthermore, if the equivalence class of all functions h p (x\ x") e 
L( 2 )(IR 2n ) which satisfy (8.65) contains a continuous function, then we 
can set by definition 

<*' I P I *"> = E W) 

l 

for appropriate choices of l P k (x) y k = 1, 2,... . Indeed, in that case by 
Mercer’s theorem (see Riesz and Sz. Nagy [1955], Section 98), the func¬ 
tions W^x) eL( 2 )(IR n ), k — 1, 2,..., appearing in (8.65) can be chosen to 
be continuous, and then the above series converges pointwise (and, 
in fact, also uniformly on compact sets). Hence, in the practically 
important case of continuous density matrices | p \ x ") eL( 2 )(IR 2n ), 
we can obtain the “diagonal” values defined in (8.68) simply by setting 
x f = x" = x in the chosen continuous density matrix (x' \ p \ x"}. 

The most frequently used representations of density matrices are 
those in configuration and in momentum space. For example, for a 
system of N spinless particles possessing a total of n = 3N degrees of 
freedom, the frequent choice of spectral representation space is L 2 (R n ), 
and therefore the Liouville space can be identified with L 2 (R 2n ), where 
the measure p is in this case the Lebesgue measure on the measurable 
space (R n , 3$ n ). 

**8.9. Appendix: Classical and Quantum Statistical Mechanics 
in Master Liouville Space 

Classical and quantum statistical mechanics are both statistical theories 
describing the same body of phenomena, and it is commonly held 
that when both classical and quantum theory are applied to one and the 
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same system (e.g., a Brownian particle moving in some gas, such as air) 
then the classical description should provide in some sense an approxi¬ 
mation to the quantum one. Yet, the conventional mathematical for¬ 
malisms for these two cases are so very dissimilar that it is not at all 
evident that this observation is of universal validity. Thus, whereas, 
as we have seen in §§8.6-8.8, the states of an ensemble are represented 
in the quantum case by density operators p in a Liouville space, their 
evolution in time (in the Schroedinger picture) being governed by the 
von Neumann equation (8.44), in the classical case the states of the same 
ensemble are represented by distribution functions f(q, p\ t) on phase 
space r (where r = U 2n in case of n degrees of freedom) which obey 
the Liouville equation 

(8.7D ^r = i(T^--T4 L 

dt £i \ dqi dpi dpi dq { 

at almost all (q, p) e where H cl is the classical Hamiltonian; e.g., 
for time-independent potentials V(q) and N particles moving in three 
dimensions 

(8.72) rn iq,p;t) = ££+V(q), 

where n = 3N> q = (q 1 ,..., q n ) 9 p = (p 1 and m zj+1 = m 3j+2 = 

msj+z > j = 0,..*, N — 1. 

In this appendix we shall indicate how one can take advantage of the 
very general approach to quantum mechanics developed in this chapter 
to treat classical and quantum statistical mechanics in a unified manner 
by working in both instances on the same Hilbert space. In the afore¬ 
mentioned case of n degrees of freedom this Hilbert space will be L 2 {T). 
We shall refer to the Liouville space over L 2 (T) as the master Liouville 
space for both theories.* 

We represent in ££ r the classical state of an ensemble by the equivalence 
class of all density operators p cl e for which 

n 

(8.73) f(q, P) = <(?,/) I pP 1 | q,py = Urn £ w k \V k {q, p) |i, 

1 

* It is also possible to recast quantum statistical mechanics in terms of a formalism 
bused on phase space distribution functions. For these distribution functions one can 
adopt the Wigner transforms of density matrices (see, e.g., Balescu [1975]). However, 
this transform is not positive definite, so it cannot be interpreted as a probability density 
on phase space (see, e.g., Srinivas and Wolf [1975]). The alternative is to use the stochastic 
plume space probability densities briefly described later in this section (see the review 
articles by Prugovegki [1979, 1981 a, b] and Ali and PrugoveCki [1980] for details 

anti further references). 
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almost everywhere in L 2 (.T), where / {q y p) is the r distribution function 
describing that state. Introducing in 3? r the superoperator 

(8.74) 


H c O 


n 





■Pic 


a \ ( 8V(g f ) 

Sq" k ' \ 8q k ’ 


8 

dpk 


Mill 


it is easily verified* that if we restrict ourselves to diagonal elements, 
by setting q’ = q", p' = p" in the (classical) von Neumann equation 


(8.75) 


i(q',p' 


cl 


dt 


q\p‘ 




H C V( X I q",P"> 


for any density matrix representing an evolving state in accordance 
with (8.73), then (8.75) coincides with the Liouville equation (8.71). 

To formulate quantum statistical mechanics on 3 ? r , we adopt in 
L\r) the following representation of the canonical commutation relations 

(8.76) Q k □ ih , P k Dp k ~ ih , k = 1,..., w. 

“Pic cq k 

Of course, this representation is highly reducible and according to 
Theorem 6.4 it can be decomposed into a (countable) direct sum of 
irreducible representations. Theorem 6.4, however, does not claim that 
the decomposition is unique, and indeed an uncountable infinity of 
such decompositions exists. 

To uncover those closed subspaces of L?{T) carrying irreducible repre¬ 
sentations of physical interest, we consider positive operator-valued 
measures of the form 


(8.77) P V (B) = f | y^Xy*.* I d n q d n P> B e <g*« 9 

J B 

where for each fixed (i q , p) e r the function 

(8.78) „(?', p') = [ exp [{ijh){p - p') ■ x] y *(x - q') y(x - q) d n x 

J u n 


* Further details and proofs of all the statements in this section can be found in AH and 
PrugoveSki [1977a, b] and Prugovefiki [1978a-d, 1981b]. 
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represents an element of L 2 (P) if yeL 2 (R n ). It turns out that IP V (P) 
is the projector onto a closed subspace of L 2 (P) when {2nK) n ! 2 y{x) 
is normalized, and that this subspace L 2 (P V ) is left invariant by 
the operators (8.76) that induce in L 2 {T y ) an irreducible representation 
of the canonical commutation relations. The last fact is established by 
proving that the mapping 

(8.79) p(x) i-* p(q,p) = f exp[— (i/fi) p * x ] y*(x — q) p(x) d n x 

\ n 

supplies a unitary transformation U y of L 2 (R n ) onto L 2 (P V ), and that the 
unitary transforms UyQjJJy 1 and of the Schroedinger repre¬ 

sentation (6.8) of the canonical commutation relation for n degrees of 
freedom coincide with the restrictions of the operators Q k and P k in 
(8.76) to the closed subspace L 2 (P V ) of L 2 (P) [note that L 2 (P V ) is itself 
a Hilbert space!]. In general, distinct choices of y eL 2 (IR n ) will lead to 
distinct spaces L 2 (P V ) (that have only the zero vector in common) as 
illustrated by the choice 

(8.80) y (s \x) = (ti#V)-*/‘ exp(—* 2 /2fo 2 ), s > 0, 

providing a continuum of different subspaces L 2 (P v(s ,) as the parameter 
s varies over (0, oo). 

The fact that the mapping U y defined in (8.79) is unitary for each 
y sL 2 (U n ) indicates that one and the same quantum state of an ensemble 
will be represented in by an equivalence class of density operators, 
albeit the equivalence relation has to be defined in a way that is different 
from the method used in the classical case, namely to each p s JSf(L 2 (P)) 
= Se r , given in the configuration representation by the density matrix 
(jx | p | x*y, we assign a unique p y — U^U" 1 in each of the Liouville 
spaces Se r over L 2 {T y ). The essential point, however, is that a (quantum) 
von Neumann equation 

(8.81) Kq\p' I dpjdt I q\p' f > = (q\p f \ H Pt | q\ 

representing a counterpart of (8.75) can be introduced globally on L 2 (P) 
by setting 

(8.82) ft = h-'[H 0 + fly, •], #0 = 1 P?\*m, til = V(Q), 

2=1 

and that due to the unitarity of U y the restriction of this global equation 
to each -5^ supplies a time evolution for each 

(8.83) p t v = exp[i(ft„ + fti)<]p v . t e IR 1 , 
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that is unitarily equivalent to that supplied by the conventional von 
Neumann equation in the configuration representation (naturally, the 
self-adjointness of Q k and P k guarantees that of i? 0 and H Y defined as 
operator-valued functions in accordance with Theorem 2.5, and the 
question of the self-adjointness of H 0 -j- H l can be handled with the 
methods of §7); furthermore, when the restriction to diagonal elements 
is executed in (8.75) and (8.81), a detailed comparison shows that in 
the zeroth order in ft the two equations coincide (cf. Prugovecki [1978b]). 

This last observation supplies the first indication as to the exact 
sense in which the classical theory approximates the quantum theory 
when in the later case terms of first and higher orders in ft are neglected. 
However, the physical validity of this conclusion depends on the existence 
of a physical interpretation for the diagonal elements of density matrices 
in 5£ r . In the classical case such an interpretation has been implicitly 
imposed from the very beginning by the relation (8.73) between F 
distribution functions and density matrices on S £ r . 

To discover the physical meaning of (q, p | p y | q, p} in the quantum 
case, we use the mapping 

(8.84) p^p v = U yP U~\ P g ^(L*(R»)), P y g J^ ry , 


to establish that as a consequence of the definition (8.79) of U y , 

(8.85) f <q,p\ P v \q,p>d n p 

J u n 

= f Xq v ( x K x \p\ x > d n x y Xq v ( x ) = (27ih) n I y(x — q) | 2 , 

J [IS n 

(8.86) f (q,p \p y \ q,pyd n q 

J u n 

= f */(*)<* I p I *> d-K xAk) = (2**r I y(k - p )|», 

J u n 

where (k | p | k} denotes the density matrix in the momentum repre¬ 
sentation, i.e. (cf. §8.8), 

OI p I *> = I I M*)l 2 » I p I ky = £ Wi | $ } {k)\\ 

3=1 3=1 

The physical meaning of the integrals in (8.85) and (8.86) emerges as 
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sooti as the resulting measures 


(8.87) P® 1 . °"(B) 

P y 

= Tr|> v P v (P X R”)] = f d n q f Xi v ( x K x I P I *> d n x, 

JB •'pn 


(8.88) P p \ . P \B) 

P y 

= Tr[p y P v (U n X B)] = f d n p f x v ( k K k \p\ k > dnk > 

J B •'p» 


on (R n , & n ) are compared with their conventional counterparts obtained 
by specializing (8.69) to the configuration and momentum represen¬ 
tations, respectively: 


(8.89) P°' ° n (B) = Tr [pE 0 * ° n (B)\ = \ (x | p \ x) d n x, , 

(8.90) P Pl Pn (B) = Tr[ P E Pl . Pn (B)] = f (k \ $ \ Uy d n k. 

p Jb 

Indeed, if we consider (8.78) and (8.88) for the case of y given in (8.80), 
we easily establish that 

(8.91) P* 0n (B) = lim P^(t) * ’ 0n (B ), 

(8.92) P Pl Pn (B) = s lim P p ^ s r Pn (B). 


For $e(0, + oo), or for the case of a general y, (8.87) and (8.88) 
also have an obvious interpretation: whereas (8.89) and (8.90) are the 
probabilities that would be measured if in each random sample of values 
of position and momentum, respectively, each value x e and k e (R n 
would be determined with absolute precision, (8.87) and (8.88) are 
probabilities obtainable with realistic measurements performed with 
actual instruments whose readings necessarily involve a margin of 
uncertainty (cf. Dietrich [1973]). In other words, by executing the 
calibration of such instruments we always find out that, say, a reading 
q in reality means that for any interval I containing q the confidence 
probability of the actual value x being within I equals ^, v (7), an< ^ th at 
measure ^%B), is never a S-measure centered at x. Thus, in 

our case the respective confidence measures would be 


(8.93) /*/(#) = f MB) = f *„*(*) **, 

j b j b 


and we can regard (8.78) and (8.88) as the probabilities of obtaining 
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stochastic values ( q , puf) and ( p , /2 p v ), respectively, with q e B and p e B. 
Naturally, under normal circumstances the difference between (8.87) 
and (8.89), or (8.88) and (8.90), might be negligible by comparison with 
the statistical fluctuation displayed by different samples. 

Yet, there is one obvious advantage of dealing with stochastic values 
in quanturti mechanics: if we do that we can then consider the possi¬ 
bility of simultaneous stochastic values (( q , puf), ( p , /2 p v )) for position 
and momentum without violating the uncertainty principle, since in 
view of the definitions in (8.85) and (8.86) of the confidence functions 
Xq{ x ) and Xp v (k)> the spreads of these functions automatically obey 
that principle. Denoting by r v the resulting space* of all such stochastic 
values obtained as (q, p) varies over r with y fixed, we can consistently 
extend the standard interpretation of quantum mechanics based on 
Definition 1.3 by interpreting 

(8.94) P Ql °»’ Pl Pn (B) - Tr [p v P v (B)], B e 

p v 

as the probability that a determinative measurement of stochastic 
values in r v on an ensemble in state p would yield outcomes ((<"/, p- q y ), 
( p , (k p v )) with ( q, p) s B. Indeed, by (8.87) and (8.88), 


(8.95) P°J . °"- Pl . P "(B x R w ) = P°J . 

(8.96) P Ql .°"- Pl . p ”(U n x B) = P Pl P "(B), 

P V P y 


so that this interpretation is simply an extrapolation of the conventional 
one. 

One of the many attractive features of such an interpretation is that 
it naturally leads to a concept of probability current in quantum 
mechanics without any need for extraneous new postulates. Consider 
for the sake of simplicity the case of a single particle, i.e., of three 
degrees of freedom and of r = R 6 . Setting in the classical case 

(8.97) pf( r) = f <r,p | pf \ r, p> dp = f / f (r, p) dp 

V *V 

(8.98) jf(r) = f v /f(r, p) dp, v = p/m, 

''os® 

we see that as the average value of velocity, j^(r) represents & probability 
current at r e R 3 . Specializing (8.71) and (8.72) to n = 3 and integrating 


This is called stochastic phase space ; cf. PrugoveSki [1981a], 
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both sides of the Liouville equation (8.71) in p over R 3 , we obtain the 
well-known continuity equation 

(8.99) 8 tP f(r) + V-,f(r) =0 

expressing probability conservation. 

Let us similarly define in the quantum case 

(8.100) pl s) ( r) = f <r,p | pf* \ r, p> dp, 

V 

(8.101) jj s) (r) = J (p/m)<r, p | pf \ r, p> dp. 

We can then easily establish that (cf. Prugovecki [1978a]) 

(8.102) 8 tP \\ r) +V-j< s) (r)=0, 

so that jj s) (r) possesses all the natural features of a probability current. 
Moreover, 

(8.103) Hmp (s) (r) = f jW]c \ 0 fc (r)| 2 = <r | p | r> 

(8104) Jim j<«>(r) = ~ f ®J0**(r) V«r) - «r) V0 s *(r)] 

whenever V?/f fc (r), k = 1, 2,... are continuous in a neighborhood of r 
and the two series can be suitably majorized. The expression on the 
right-hand side of (8.104) coincides with the probability current found 
in all text-books on quantum mechanics (see, e.g., Messiah [1962]). 


Exercises 

8*1. Show that the projector Z? M onto an infinite-dimensional closed 
subspace M of is not completely continuous. 

8.2* Prove that if the continuous spectrum of a self-adjoint 
operator A is not empty and contains points which are not accumulation 
points of the point spectrum, then contains infinitely many points. 

8*3* Show that a bounded self-adjoint operator with a pure point 
spectrum, containing only eigenvalues of finite degeneracy which have no 
accumulation points with the possible exception of zero, is a completely 
continuous operator. 
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8*4* Show that the trace of a projector is equal to the dimension of 
the space onto which it projects. 

8*5* Show that if a self-adjoint operator A is positive definite, i.e., 
</l 4f> >0ior all fs SJ A , then its spectrum does not contain negative 
values. 

8*6* Prove that if Tr A is finite and A = A*, then the traces of the 
operators A (+) and A ( ~ } defined in (8.24) are also finite. 

8*7* Show that if the linear operators A , B y C are the uniform limits 
of the sequences {A n }, {B n }, {C n }, then ABC is the uniform limit of the 
sequence {A n B n C n }. 


References for Further Study 

The theory of quantum measurement has many ramifications that traditionally are 
surveyed primarily in monographs on the philosophy of quantum mechanics, such as 
d’Espagnat [1976] or Jammer [1974], or in specialized collections of review articles, 
such as those by Price and Chissick [1977] or Marlow [1978]; for basic aspects see, 
however, Messiah [1962], Gottfried [1966], and Ballentine [1970]. 

A great deal of functional analysis pertinent to the mathematical background of §§2-7 
can be found in Riesz and Sz. Nagy [1955], Akhiezer and Glazman [1961], Stone [1964], 
Schechter [1971], Reed and Simon [1972, 1975], and Yosida [1974]. 

The operator-theoretical treatment of complete sets of observables of §5 is taken from 
Prugovecki [1969a], whereas other rigorous treatments are based on a PF*-algebra 
approach (Jauch and Misra [1975]). The case of infinitely many observables has been 
given a careful mathematical treatment by de Dormal and Gautrin [1975], and relevant 
results are also reported by Berezanskii [1978] and Ascoli et al. [1978]. 

The subject of canonical commutation relations for a finite number of degrees of 
freedom treated in §6 is thoroughly surveyed by Putnam [1967], who also provides an 
extensive bibliography on the subject. An introduction to the case of infinitely many 
degrees of freedom that contains further references can be found in Bogolubov et al. 
[1975]. 

Many of the results on essential self-adjointness of Schroedinger operators that were 
only briefly dealt with in §7 are reviewed by Kato [1967] and by Reed and Simon [1975]. 
The subject of group theory in quantum mechanics (barely touched upon in §7.8 since 
it lies beyond the scope of the present book) has been given careful mathematical treatment 
in many monographs, such as those by Miller [1972] and by Barut and R^czka [1977]. 

The concept of density operator was first introduced by von Neumann [1955] in the 
original German edition of his classic book, and by now it forms the basis of all standard 
formulations of the concept of state in quantum statistical mechanics (see, e.g., Balescu 
[1975]). However, other formulations exist, such as those based on the C*-algebra 
approach (see Emch [1972]). Many questions pertinent to the related subject of com¬ 
pletely continuous, Hilbert-Schmidt and trace-class operators are systematically treated 
by Schatten [I960]. 
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It should be mentioned that there have been many attempts to generalize the for¬ 
malism of quantum mechanics by employing mathematical tools other than Hilbert space 
theory. Perhaps the best known is that employing lattice theory, and a very readable 
account can be found in Jauch [1968], whereas Varadarajan [1968] covers exhaustively 
the more technical aspects of this subject. In addition, other mathematical disciplines, 
ranging from W*- and C*-algebras (von Neumann [1936], Segal [1947]) to measure 
theory (Prugovecki [1966]) have been used. Yet, Hilbert space theory has remained the 
one and only practically viable alternative that enjoys universal acceptance in its role of 
providing the mathematical background for nonrelativistic quantum mechanics. 



CHAPTER V 


Quantum, Mechanical Scattering Theory 


In the preceding four chapters we have given the basic mathematical 
tools needed for a mathematically rigorous study of conventional non- 
relativistic quantum mechanics. The actual applications have been 
limited either to obtaining very general, albeit fundamental, results of 
quantum mechanics or to computing the bound states of specific systems 
(e.g., the hydrogen atom). 

Within the context of nonrelativistic quantum mechanics, scattering 
theory deals with collision phenomena of particles moving at velocities 
small by comparison with the speed of light. Since scattering theory is 
concerned with completely or partially free particles, it is of utmost 
interest from the experimental point of view. From the theoretical point 
of view, scattering theory is still the object of intensive research for the 
physicist as well as the mathematician. 

We shall give in this chapter the most basic ideas of nonrelativistic 
quantum scattering theory. In illustrating these ideas, we shall limit 
ourselves to two-particle scattering, which is the only case which has 
received a thorough treatment until now. The scattering of n particles, 
usually referred to as n-body or multichannel scattering theory, is a 
field of active current research, which deserves at least one entire 
volume for a minimally adequate treatment. A very short introduction 
to this general case will be given in §8 of the present chapter. 

1 ♦ Basic Concepts in Scattering Theory 
of Two Particles 

1.1. Scattering Theory and the Initial-Value Problem 

In studying in Chapter II n-particle systems in quantum mechanics, 
we were concerned exclusively with problems involving bound states. 
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Such states are related to the point spectrum of the internal energy 
operator of the system, as described in §5, Chapter II. We shall now 
turn our attention to two-particle states which are not bound states. 

The solution of the general initial-value problem for an w-particle 
system is in essence contained in Axiom S3 of Chapter IV, §3—namely, 
if we could prepare by some means a state of the system at the instant 
t 0 , and if the Hamiltonian H were given, then, in principle, we could 
compute the state* 


W(t) = exp [~iH(t - * 0 )] W 0 


at all later instants t. However, such a computational job, besides being 
in most practical cases unmanageable due to its extremely great com¬ 
putational complexity, is not warranted by the needs of the experiment¬ 
alist who carries out experiments at the present level of technical sophis¬ 
tication. 

The experiments carried out at present in molecular, atomic, nuclear, 
and elementary particle physics which involve, from the theoretical 
point of view, an initial value problem are exclusively scattering experi¬ 
ments. Such experiments are characterized by the fact that the system 
undergoes a preparatory measurement at some instant t Q , a deter¬ 
minative measurement at a later instant t x , and that for most of the time 
span t x — t 0 of the experiment there is no interaction (or practically 
negligeable interaction) between the different constituent parts of the 
system. As a matter of fact, these parts interact with one another only 
for a very small part of the duration t± — t 0 of the experiment. For the 
rest of the time they are spatially too far apart to have the very short- 
range forces make themselves felt. This and some other features of 
scattering experiments, which will be mentioned later, make it 
unnecessary to solve completely the initial-value problem in order to 
relate the theoretical predictions to the available experimental data. 

It is clear from the above remarks that for the greater part of a 
scattering experiment, the different constituent parts of the physical 
system under observation move independently from one another, i.e., 
they are “almost free.” In order to elucidate this intuitive concept of 
“almost free” motion in a mathematically clear and concise form, it is 
desirable to consider this concept first in the framework of classical 
mechanics, which readily appeals to intuition. 

# In this chapter we adopt a system of units in which h = 1. 
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1.2. Asymptotic States in Classical Mechanics 

Let us consider a two-particle classical system. In the Newtonian 
formalism of classical mechanics, the state of such a system is given by 
a vector-valued function 


(1.1) x(t) = (*!(*), y^t), z^t), x 2 (t\ y 2 (t), z 2 (t)) e U G . 


The trajectory of the kth particle, k = 1,2, can be described by a three- 
dimensional vector-valued function r k (t), which geometrically is a curve 
in three dimensions. If the two particles of the system undergo the kind 
of motion which was depicted earlier as typical in a scattering experiment, 
then the trajectories r k (t)> k = 1,2, should have as characteristic the 
main qualitative features of the curves in Fig. 4: at the beginning t & t Q 



Fig. 4. Asymptotic states in classical mechanics. 

of the experiment and for a relatively long time afterwards the motion of 
each particle should be “almost free,” i.e., “almost” along a straight 
line rjf(£) at a uniform speed vjf, k = 1, 2; then, during a relatively short 
period, while the particles are interacting with one another, the motion is, 
in general, very complex; finally, at the end, t & t x , of the experiment 
and for some time before that, the motion of the particles should be 
again along straight lines r£ ut (£) and at “uniform speed” v^ ut . 

Now, in classical mechanics a particle is said to be in free motion if its 
state is of the form 

r(0 = r 0 + vt. 


so that r(£) = 0. Hence, at times t & t 0 , we expect that 
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and similarly for t t x 


out . out. 

^ r fc + v fc U 


k = 1,2. 


The vectors rj. n and vj. n characterize a free state of the kth. particle, 
k= 1,2, which is called the incoming asymptotic state . Similarly, the free 
state characterized by r£ ut and v£ ut is called the outgoing asymptotic state 
of the &th particle. 

It is not a priori clear in what precise sense the approximation signs 
in (1.2) and (1.3) should be understood. One obvious choice is to take 
them to mean that 

I r k(t) — (i{ n + v*t)\ »0, t < 0 > 


where | • | above denotes the length of three-dimensional vectors. 
Since one can reasonably expect that the situation does not intrinsically 
change if we increase — t Q , and, moreover, the above approximations 
might hold with increasing accuracy when t 0 — > — oo, it seems natural 
to give to (1.4) the following precise meaning: 

Um I r fc(0 ~ ( r * n + vl "0l = °» 


lira | r k (t) — v “ | = 0. 


Similar considerations lead to the following corresponding require¬ 
ments: 

Hm | r k (t) - (r“ ut + v° ut <)l = °> 

/jm I r*(0 - v° ut | = 0. 


The conditions (1.5) and (1.6) define in a precise way the asymptotic 
states of a single particle. Naturally, the asymptotic states # in (£) and 
x °ut(j) 0 f tho entire system is the aggregate of the asymptotic states of 
its two constituent particles. 

Incoming asymptotic states defined by (1.5) can be thought of as 
being states in which the particles of the system would be at all times if 
there were no interaction between the two particles. Similarly, the out¬ 
going states would correspond to the fictitious case of particles which 
have interacted in the “infinite past,” and afterwards move inde¬ 
pendently. 
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We note that according to the definitions in (1.5) and (1.6), if asymp¬ 
totic states exist at all for a given classical physical process, then they 
are unique for each state, since v^ n and v£ ut are uniquely determined by 
the second of the respective relations in (1.5) and in (1.6), and then rjf 
and r£ ut are uniquely determined by the first relations in (1.5) and (1.6), 
respectively. It is, however, natural to ask whether asymptotic states in 
the above sense exist for every classical physical process of importance. 

The answer to the above question is negative. In the case of long- 
range forces, i.e., forces which are of significant intensity even at 
arbitrarily large separations between particles, asymptotic states in the 
above sense do not exist. A notable example of such a force is the 
Coulomb force (see Exercises 1.1 and 1.2). The intuitive physical 
explanation of this phenomenon is that long-range forces never suffi¬ 
ciently loosen their grip on the particles to allow them to travel in a free 
manner when they are at large distances from one another. A generali¬ 
zation of the above concept of asymptotic state is obviously necessary to 
deal with this situation. However, the already introduced concept is of 
sufficient generality to cope with most of the practically important cases. 
Even the Coulomb potential is in practice most often “screened,” i.e., 
instead of having to deal with the Coulomb potential V(r) ~ 1/r, we are 
faced with a potential which at large distances decreases much faster 
than 1/r, and for which asymptotic states, in the above sense, exist. 

1.3. Asymptotic States and Scattering States 

IN THE SCHROEDINGER PICTURE 

It is straightforward to adapt the concept of asymptotic states defined 
in the classical two-particle case by (1.5) and (1.6) to the case of quantum 
mechanics. We can achieve this by stating that in the Schroedinger 
picture, the “free states” W in (t) and ^^(t) are, respectively, the 
strong (or Moller) asymptotic incoming and outgoing states of a two- 
particle system in state ¥*(£) if and only if* 

(1.7) lim || W(t) - *f' ln (<)|| = lim || W(t) - W° u \t) || = 0. 

—00 t-*- f-00 

In order to define precisely the meaning of the concept of a “free 
state” we must assume that in addition to the “total” Hamiltonian H {2) 
of the two-particle quantum mechanical system we have at our disposal 
a “free” Hamiltonian which would describe a system consisting of 
two particles with the same physical characteristics (mass, intrinsic 


* See §2.7 for a more general concept of quantum asymptotic state. 
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spin, etc.), but which do not interact among themselves. For instance, 
in the case where H {2) is given essentially* by 


< u > - 2sr 4 '-isr A >+’ , W' ' = '«-*■ 

then H ( q ] is taken to be essentially 


(1.9) 



1 

2 m 2 


A 


2 > 


where the domains of definition of the above differential operators are 
adequately defined/ as was done in §7 of Chapter IV. However, if both 
particles move in an external force field, given by a potential 
F ex t(f*i > * 2 )* then instead of (1.8) we have 

- 2^ Al “ 2 ^ + F(r) + Fext(ri ’ r *>’ 
and instead of (1.9) we have 

- 2 ^ Al ~ 2^7 Aa + Fext(ri ’ ra 

This will be the case when, for example, the system under consideration 
consists of two spinless charged particles moving in the force field 
generated by a much heavier particle (such as the heavy nucleus of some 
atom). Since the heavier particle is practically unaffected by the motion 
of the lighter particles, such a problem can be treated fairly accurately 
as a two-body problem in an external field rather than a three-body 
problem. 

Consider now the general case when the “free” Hamiltonian H {2) 
and the total Hamiltonian H {2) are given by the self-adjoint operators 
acting in the Hilbert space Jf {2) associated with the system. We say 
that a nonzero vector-valued function t e R 1 , is a free state in the 

Schroedinger picture if and only if it satisfies the relation 

W if \t) = exp(-iH^t) W {f \0), t e U\ 

and if ¥ /( ^ , (0) is orthogonal to the closed linear subspace Jfob bound 
states in Jf? {2) of the system with Hamiltonian H { 0 2) . In the absence of 


# See the discussion in §7 of Chapter IV on the essential self-adjointness of the 
Schroedinger operator. 

• We note that in the preceding two operators, as well as throughout this chapter, we 
have adopted a system of units in which h = 1. 
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external forces both H {2) and its internal energy part Hffi, obtained 
after subtraction from H^ 2) of the center of mass motion, are operators 
with a pure continuous spectrum so that = {0}. For example, 

this is the case when H {2) is given by (1.9), and therefore is essentially 
—(l/2m 0 )A acting on wave functions */r(R, r), where A is the Laplacian 
containing derivatives with respect to the relative position coordinates 
of r and m 0 = -j- m 2 )~ 1 . 

In reality, the particles interact, and any Schroedinger picture state 
¥*( t ) in which they can be found, which will be called occasionally an 
interacting state , has to satisfy the relation 

W{t) = exp (~iH™t) V(0), t e R 1 . 

Due to the unitarity of the operator exp(— iH {2) t) we have 

(1.10) II m - = II exp(-*77 <2> 0 'F(0) ~ exp W ex (0)\\ 

= || W(0) - exp(iff®*) exp W ex (0)\\ 

= || expexp(-iff®*) «P(0) - ^ ex (0)||, 

where “ex” is a symbol which will be frequently used in the future, 
and which stands for “in” and “out.” 

In view of the preceding notation, we see that (1.7) implies that 

(1.11) lim || F(0) - exp (iH i2) t) exp (-iff®/) ^ ex (0)|| = 0, 

t~* ~F co 

or, equivalently, 

(1.12) || ^(O) - exp(«7/' 2) 0 exp (-iH (2) t) ^(0)11 = 0, 

where the interpretation of the above notation is that one should let 
t -> — oo when “ex” stands for “in,” and t -> +oo when “ex” stands 
for “out.” Since, conversely, (1.11) or (1.12) implies that (1.7) is true, 
we can state the following theorem. 

Theorem l A* The state ¥*(£) = exp(—iH {2) t) *¥(0) possesses strong 
incoming and outgoing asymptotic states W e *(t): 

(a) if and only if the strong limits 

(1.13) W ex = s-lhn exp (iH {2) t) exp (-iH i2) t) W(0) 9 

exist and belong to ^ (2) © 
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(b) if and only if there are vectors ^^(O) e © c^ 2) , ex = m > out > 
such that 

(1.14) ^(0) = s-lim exp (iH i2) t) W ex (0). 

"F oo 

An interacting state ^(t) which has an incoming as well as an outgoing 
asymptotic state is called a scattering state. 

We recall from relation (3.43) of Chapter IV that 

9(t) = exp(iH ( 0 2) t)exp(-M (2) t) W( 0) 

represents in the interaction picture the same state which is represented 
in the Schroedinger picture by W(t) = exp(— iH {2) t) ^(O). Thus, 1 P rex , 
ex = in, out, can be thought of as being the asymptotic states of t ) 
in the interaction picture. 

1.4. Moller Wave Operators 

Denote by R+* and R {2) the set of all vectors which are the limits (1.14) 
for some ^^(O) e «^ (2) © (where ^ob is spanned by the bound 
states of Hq 2) ) when t -> — oo and t -> + oo, respectively, i.e., 

R? = {/+ :/ + = s-lim Q<*(t)f,fejrQ jftV}, 

t ~*—00 

(1.15) R? = {/_ :/_ = s-lim Q i2 \t)f,fe * © •*$}, 

i-»+oo 

.Q (2) (£) = exp (iH^t) exp(— iH^t). 

We see from (1.14) that if a nonzero vector g belongs to either R (2) or 
R!?\ then exp(— iH {2) t)g does not represent a bound state, since it is 
intuitively obvious that a bound state cannot be asymptotically free. 
Hence, it seems natural to assume that in any realistic quantum 
mechanical two-particle theory 

R (2) _ D (2) 3 #?( 2 )± 

+ — b , 

where is the space of bound states of H i2) . We shall see later that 
the conclusion of this heuristic argument is confirmed when dealing 
with potentials which usually occur in practice. 

Lemma If A(t) is a uniformly bounded family of operators, 


II A(t )|| < C, 


— 00 < t < + 00 , 
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then the set M of all vectors / e for which 

s-lim A(t)f, — oo < t 0 < +oo, 

t->t 0 

exists is a closed linear subspace of 

Proof . Since the existence of the strong limits of A(t)f and 
A(t) g for t —> t 0 implies the existence of the strong limits of 
A(t)(af + bg) = aA(t) f -j- bA(t) g, it follows that M is a linear subspace 
of jtr. 

Let /be a vector in 3f which is the strong limit of a sequence f x ,/ 2 
of vectors from M. If 

gn = s “^ m A{t)f n , 

t~*t 0 

then we can write 

(1.16) 11^) - A{t 2 )}f\\ 

= \\[A( tl ) - A(t 2 )](f -f n ) + [A( tl ) - A(t 2 ))f n || 

< ||[^(g - A(t 2 )](f - A)II + II A( tl )f n - gn - [A(t 2 )f n - I-JII 

< \\Ah)(f -fn)\\ + II A(t 2 )(f -A)|| 

+ \\A{ti)f n - g n || + || A(t 2 )f n - g n || 

< 2C ||/ - A || + || A( tl )f n -g n || + || A(t 2 )f n - g n ||. 

By choosing some sufficiently large n = n 0 , we can obtain 

2C||/-A„ll<«/3. 

Since, for this fixed n 0 , we have 

\\A(t)f« 0 -g«J<*l3 

for all t sufficiently close* to t 0 , we see by glancing at the right-hand side 
of (1.16) that 

\\A{t 1 )f-A[t 2 )f\\<, 

for all such t 1 , t 2 which are sufficiently close to t 0 . Hence, the strong 
limit of A(t)f for t —*■ t 0 exists, and therefore fe M. Q.E.D. 

The operator function Q i2) (t) defined in (1.15) assumes as value a 
unitary operator for every value of t e R 1 , and consequently it is uniformly 
bounded: 

||£? ,2) (<)|| = 1, —oo < t < +oo. 


* If £ 0 — db°o, sufficiently “close** to t 0 means sufficiently large. 
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Thus, we can apply Lemma 1.1 to conclude that the families M ( _^ of 
all vectors / e e^ (2) © for which the strong limits Q (2) (t)f for 

t—> —co and t -> +°o, respectively, exist are closed linear subspaces 
of ^ (2) © ^ob* Hence, the family 

m (2> = m ( 2) n m (2) 

of all vectors / for which both limits, 

s-\im& 2 \t)f=f ± . 


exist is a closed linear subspace of ^ (2) . 

Let be the projectors onto 

(1.17) £ M p?f <2> = Mi 2> . 

Then, according to the above definition of M^, the limits 

s-lim Q™{t) E^f 

exist for all vectors /e Jf, because E^f e Then the operators 

(1.18) Qf = s-lim D i2 \t) E m ( 2 ) 

exist and are defined on the entire Hilbert space J^ {2) . The operator Q {2) 
is called the incoming Moller wave operator and Q (2) is called the outgoing 
Moller wave operator . We shall refer to these two operators as the “in” 
and “out” wave operators. 

1.5. The Scattering Operator 

The scattering operator (or S operator) S (2) for the problem at hand is 
defined in terms of the wave operators Q {2) , 


(1.19) S i2) = £i 2) *^ 2) . 

Since the wave operators are strong limits of uniformly bounded 
operators Q m {t), || .f2 (2) (J)|| ^ 1, they are bounded. Furthermore, we 
have seen that.Q^ 2) are defined everywhere in ^ (2) . Hence, the adjoints 
f2 ( j a) * exist and are bounded (see Chapter III, Theorem 2.7). Thus, S {2) 
is a bounded linear operator, defined on the entire Hilbert space 
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The importance of the scattering operator lies in the fact that many 
quantities which are readily measurable by scattering experiments are 
easy to compute if we know S (2) . 

To provide an illustration, assume that a scattering state ¥+(*) of the 
system has been prepared at the beginning of the scattering experiment, 
and that at the instant £ 0 of the preparatory measurement W + (t) had 
been essentially a free state 


lim II W + (t) - ^ n (<)|| = 0, V?(t) = expS'ftO). 

—00 


Furthermore, assume that a determinative measurement at an instant 
t x , long after the two particles of the system have interacted, can 
determine the value of the observable E t of the form 

f i 

E h = | ¥'_(< 1 )><¥ / -(*i)l> 11^)11 = 1, 

which is represented by the projector on the scattering state W_(t) 
at t = t x . Then, if is normalized, the mean value of E ti is given by 
the expression 

(1.20) P 7 ^ 7 _ = <w + ( tl ) I E h = |<^i) I ^-(^)>l 2 . 

We recall from the discussion at the end of §1 in Chapter IV that the 
above expression is usually referred to as the transition probability 
from W+{t) to due to the usual assumption that if the outcome of 

the determinative measurement of E t is A = 1, the system is left after¬ 
wards in the state 

We note that the expression in (1.20) is actually independent of t x , 

(1.21) P 7+ -v_ = l<¥+(0) I ^-(0)>l 2 > 

due to the fact that the operator exp(— iH {2) t) is unitary. Let W^^t) be 
the strong outgoing asymptotic state of 

lim II y_(t) - WT\t)W = 0, WT\t) = exp Wl u \0). 

t~*+co 

Since according to (1.14) 

W + { 0) = s-lim exp(iH i2) t) exp 'F+(0) = Qf^O), 

W_(0) = s-lim exp (iH®t) exp (-iH?t) ¥^"*(0) = Q (2) Wl u \0), 
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we arrive at the following expression for the transition probability: 

(1.22) = |<&i 2 V ut (0) | ^| 2 ¥f(0)>| 2 


= |<^ ut (0) | ^ i 2) *^ 2) ^ n (0)>| 2 


= |<^ ut (0) | s (2 Vf(0)>| 2 . 

This expression is very convenient and helpful in practice, since it 
involves free states which in the typical scattering experiment are com¬ 
putationally easy to derive from the available experimental data. 

1.6. The Differential Scattering Cross Section 

In practice, the typical two-particle scattering experiment is of the 
kind depicted in Figure 5: a beam of particles impinges on a target 



consisting of particles S 2 . Assuming that the particles in the beam do 
not interact with one another, and that each particle in the beam 
interacts with only one particle S 2 m the target, the experiment can be 
viewed as consisting of a large ensemble of independent scattering 
experiments of two-particle systems* S = {®j_, S 2 }. 


# The two mentioned conditions can be satisfied by taking a “weak” beam of particles 
(i.e.p a beam with few particles per unit volume), and taking for the target a slab of material 
which is sufficiently thin to eliminate the possibility of a particle scattering from more 
than one particle in the target. 
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The above experimental setup is ideally suited to determine the 
percentage of the particles in the beam which can be found in a given 
volume of space, occupied by a detector, after they have been scattered 
by the target. In any scattering experiment the detector is placed as 
depicted in Fig. 5, i.e., sufficiently far from the target so that it never 
interferes with the scattering process. Hence, the experiment obviously 
provides information about the probability that a particle of the beam, 
coming towards the target from the direction 

^0 = (<£o > 0<))> 0 < </> 0 < 277, 0 < 0 O < 77, 

will be scattered within the solid angle dco around the direction 

(1.23) a; = (<M), 0 < </> < 2tt, 0 < 6 < 7T. 

If N denotes the number of two-particle scatterings per unit time at 
the energy of relative motion E , then the number of particles scattered 
within the solid angle dco can be written in the form Ntt(E , co 0 , co) dco , 
since it is obviously proportional to N as long as the two earlier men¬ 
tioned conditions on beam and target are satisfied. It can be expected 
that 7 t(E, co 0 , co) is dependent on the energies E and E 9 for the relative 
motion of the particles of the system © = {S x , S 2 } before and after 
collision. However, since energy is conserved, we must have E = E', 
so that the dependence on E 9 does not have to be displayed. 

It has become customary to orient the frame of reference so that 
0 O = 0 o = 0, and to call the quantities 

(1.24) a(E, (o) = (NIJ 0 ) tt(E, co 0 , (o), a(E) = f <j(E, w) dco 

(where J 0 is the incident flux, i.e., the number of incident particles per 
unit time and unit beam cross section) the differential cross section and 
the total cross section , respectively. The notation da/dco instead of o(E , a>) 
is often encountered in literature. 

1.7. The Transition Operator 

In potential scattering H {2) and Hq 2) are given by (1.8) and (1.9), and 
one can easily relate o(E , co) to the S' operator. To establish this relation, 
it is computationally advantageous to express S (2) in terms of a new 
operator, 

(1.25) S< 2 > = 1 -27«T< 2 >. 

We shall refer to T (2) as a transition operator [not to be confused, how¬ 
ever, with the operators introduced in §4.8] since in the case when 
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there is no interaction present, the system stays in the same free state 
(i.e., there are no “transitions”), and the operator T {2) is zero. We 
easily establish that T {2) = 0 when there is no interaction by noting 
that the “absence of interaction” means that H {2) = Hq 2) and therefore 
£><*>(<) = 1, i.e., 

Qf = Q (2) = S (2) = 1. 

We have seen already in §7 of Chapter II that in the absence of external 
forces the Schroedinger operator H$ 2) of a two-particle system can be 
written in the form 


= y/ c (2> + //j <2) 

where H (2) is the center-of-mass energy operator and H {2) is the internal 
energy operator. The Hilbert space ^ (2) == L 2 (U Q ) of wave functions 
HR, r) is obviously the tensor product 

(1.26) ^f <2) = 3t (l) 

of the Hilbert spaces 1 = L 2 (IR 3 ) and — L 2 (R 3 ) of wavefunctions 
*/r c (R) and ^(r), respectively, where R and r are the center of mass 
variables given in (7.2) of Chapter II. It becomes evident from 
looking at the differential operators 

< u7 > -ii A+FwcH “ 

and (7.3) in Chapter II that H (2) = H™ ® 1 and H}*> = 1 ® H [ 1} , 
where H™ and i/ i (1) act in Jf ?( c 1) and respectively. 

It is natural to require that, in general and when no external forces are 
present, the free and total Hamiltonians can be written in the form 

r/(2) rr(2) i r/(2) r/(2) 1 zO\ Ej(l) 

H 0 — H c + #oi » — 1 (X) H 0 l , 

(1.28) H i2) = i/ c (2) + H[ 2 \ H[ 2) = 1 ® i/i (l) . 

Hc (2) = ® 1 

Since the operators H <2) and Hq 2 \ as well as H (2) and H {2) , obviously 
commute, we can use Theorem 2.4 of Chapter IV to infer that 

exp(— iH^t) = expexp 
exp(iH®t) = exp(iH®t) exp(iHi 2) t). 
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The above two relations are derivable directly from (2.13) in Chapter IV 
by noting that in the above case all operators are bounded and defined 
on the entire Hilbert space <^ (2) . 

From (1.28) we get 

(1.29) Q i2 \t) = exp (iH i2) t) exp(-iH^t) 

= exp^T/j 2 ^) expexp( exp(— iH$t) 

— exp(iH{ 2) t) txp(-iH^ft) 

= 1 ® exp(z//i (l) /) exp(— tHtft). 

Inserting the above expression for £} (2) (£) in (1.18) and (1.19), we easily 
arrive at the following result. 

Theorem L2* If the free and total Hamiltonians Hq 2) and H i2) of 
a two-body system can be decomposed into sums (1.28) of commuting 
Hamiltonians for center of mass and internal motion, then 

= l 0 S i2) = 1 (x) S {1 \ T i2) = 1 (x) T (l \ S (l) = 

where Q { ± ] and S a) are operators in the internal-motion Hilbert space 
^ (1) of the tensor product J^ (2) = Jf ?( c 1) (x) ^ (1) . 

The above theorem shows that in case of two-body potential scattering 
in which no external forces are involved, we can work in the Hilbert 
space ^ (1) which is related to the internal motion of the system. If 
we work in momentum space, we can introduce the relative momentum 
variables p, related to the relative position vector r. We might hope that 
the S {1) operator could be written as an integral operator in which, 
in case of spinless particles, would be of the form 

(1.30) (S (1) 0)~(P) = f 3 S (1) (P, P') ftp') dp'. 

J or 

However, we shall see in the §2 that S {1) commutes with the free 
Hamiltonian H^\K Lemma 1.2 shows that, on account of this fact, the 
representation (1.30) is not feasible. 

Lemma 1 ♦2 ♦ Let K be a bounded integral operator on L 2 (R 3 ) 
(K/H P )=f a K(p,p')/( P W, 

J OS 3 
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with a kernel K( p, p') which is Borel measurable in IR 6 . If K commutes 
with the self-adjoint operator H^W 


then K is the zero operator. 

Proof. If /(p) is a function of compact support A , then E*(A)f — 
> where E J> (A) is the spectral measure obtained as the product 
(1.13) in Chapter IV of the spectral measures of the three momentum 
component observables: 


(£ p (^)£)~(p) = Xj(p)i(p) 

^he operator H$E*(A) is obviously bounded, and commutes with K , 

([K, H^(A)]f)~( p) = J r;) K( p, p') (£--£ Xd(p)j f(p') dp' 

= 0 (almost everywhere). 

Since the above expression has to vanish almost everywhere for all 
/(p) of compact support, we have 

K(p, p') (J-- - ] = 0 (almost everywhere in H 3 X H 3 = IR 6 ). 

Now, the set of points in IR 6 for which | p | = | p' | (called the energy 
shell) is of Lebesgue measure zero. Hence, we deduce that K( p, p') 
vanishes almost everywhere in IR 6 . 

If /(p) and g( p), /, g e L 2 (IR 3 ), are bounded functions of compact 
support, then g*(p) K(p, p')/(p) is certainly integrable on IR 6 since 
K{ p,p') is Borel measurable and vanishes almost everywhere. From 
the relation 


f g^)K{p.p f )f{p f )dpdp f =0, 

we get by applying Fubini’s theorem and Theorem 4.6 in Chapter III, 
<i? | Kfy = f dpg*(p) f dp'K(p, p')/(p') = 0, 

J 05 3 J 05 3 

for all ^(IR 3 ). Since (R 3 ) is dense in L 2 (R 3 ), we get Kf = 0 

lor all / e ^(R 3 ). Thus, K is a bounded linear operator which is zero on 
a dense subset ^(R 3 ) L Z (M 3 ), and therefore K = 0. Q.E.D. 
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Let us introduce spherical coordinates ( p y co) = (p y 9 , </>), 
0 ^p < +oo, 0<^<7T, 27r, in the space [R 3 of the variables 

p. We shall see later in this chapter [cf. (4.96) and (4.97)] that the T (1) 
operator can be written as an integral operator with respect to the 
variable o>, 

(1.31) (T^Y(p y a>) = f TM(p; tO) a>') f(p, a>') du>\ da>' = sin 9 ' iff d<$>\ 

over the set Q & , defined in §7.2 of Chapter II, which is essentially 
isomorphic to the unit sphere in three dimensions. The function 
T {1) (p ; to, a/) is uniquely determined by the operator T (1) (see 
Exercise 1.6) if we restrict ourselves to continuous kernels for the 
integral operator in (1.31). 

*1.8. The T-Matrix Formula 

for the Differential Cross Section 

The differential cross section a(E, w) can be computed from the 
function T {1) (p ; to, co') which occurs in the integral representation (1.31) 
of the T operator. In order to arrive at an expression relating a(E, to) 
and T {1) (p ; to, co'), we have to express in theoretical terms those essential 
features of a scattering experiment which enter in the definition of the 
differential cross section. In any such scattering experiment, a beam of 
particles scatters from the particles of a target. The experimental setup 
is such that the relative momentum p of the incoming particles lies 
within a very narrow range of values. 

Let us introduce spherical coordinates ( p , 9 , <f>) in the p space, and 
suppose that the incoming particles are prepared in a scattering state 
!?( t ), which is such that p lies within the set 

ho = \Po 9 Po + 4Po]> ho = [00 A + M 0 ] X [*o, 

Denote by W in (t) the incoming asymptotic state of !?( t ). Suppose that 
^“(O) is represented by the function (see Exercise 1.5) 

(i-33) r(p,p)=toto, 

of the center of mass momentum P and relative momentum p. 

Let us denote by E {1> \B) the spectral measure 

(P, p) = Xs(p)$(P> p)- 


(1.34) 
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It can be shown (see Exercise 1.8) that for any Borel set Be# 
lim <¥*(*) | E m (B) = <^ ln (0) | E"»(B) W ln (0)}, 

£-»—oo 

(1.35) 

f |AP,p)I^Prfp=f I *A«(P)I 2 dp- 

J U 3 XB J B 

Since the support of the function <£(P, p; t) representing W{t) is within 
R 3 X <? <0 >, we deduce from the above relation that 

f . 1 ^» (p)|2 dp = °- 

j r 3 -c (0) 

Hence, $ 0 (p) vanishes almost everywhere outside C {0) . 

Denote by the outgoing asymptotic state of ¥*( t ). It is easy to 

show (see Exercise 1.4) that 

(1.36) ^(0) = S (2 V in (0) = (1 X S {1) ) ^ ln (0). 

Hence, by applying the same type of argument which led to (1.34) 
to we obtain 

fim <W(t) | E ( *\B) *P(f)> = <Y° U \0) | E ( *\B) ^ out (0)> 

= f . lf U ‘(P, p )| 2 dP dp 

J u 3 xb 

= f i(s tt, «"(p)r^p- 

J B 

Consequently, the probability P(/ —> I<» Q ) of finding that the relative 

momentum p of the system, a long time t after the scattering had taken 
place, is within the cone 

c (+) = {p: {0, </>) G1^}, 1^ = [O 0 ’ 9 <y + AO o'] x [<f> 0 \ <f >0 + am 

is given by (see also Exercise 1.12) 

(1.37) P(4 0 - V) = J c|+) \{S^o)~(p)\ 2 dp. 

We shall see in the next section that S {1) commutes with E H oi(B) 
(see Theorem 2.3), where 

p) = X (p , / 2 moSB} (p)^(p). 
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It follows that for the complement e^S 1 of the set {/> 2 /2m: p e/ p J, 
we have 


E h ^\b^ 5<!>0o = S™E h *\b x ) +0 = 0 

since EKIXBj) i(j 0 = 0 because $ 0 (p) vanishes outside <? (0) . This means 
that (5 (1) */r 0 J v ( p) vanishes when /> 2 /2m e , i.e., when | p | $I Pq . There¬ 
fore, if the integral in (1.37) is expressed in spherical coordinates, the 
integration in p extends in effect only over the set I Po . Thus, we can 
write 


(1.38) P(/„ 0 - V) = f ,d* P f P 2 dpj do,' | ^"‘(P, p)|* 

01 J 7 J»n S' 


= J f |(5<^ 0 Hp)|*. 


Let us consider the case when the direction of relative motion of the 
two particles in the system has changed as a result of interaction, i.e., 
when cu 0 ^ ojq and / ft>o n / Wo , = 0. If we use the definition (1.25) of 
the transition operator to write 


S {1 Vo = - 2 tHT ^ 0 


and recall that $ 0 '(p, co) = $ 0 (p) vanishes when co is outside I (0q , 
we arrive at the result 


(1.39) P(I W0 -1 *) = 4*r 2 f p*dp f dJ \(T«y o n P )\*. 

J S J S' 


Hence, we get for the probability density P(a> 0 ') (per unit solid angle) of 
scattering in the direction o> 0 ' 

(1.40) PM = lim 4^/ \(T^o)~(p, o^YP'dp, 

V ' V 0/ 4Jw'-0 Joi J i pQ 

provided that (T {1 ) i/jqY\ p y a/) is continuous in a/ in some neighborhood 
of cdq. Substituting in the above relation for the operator T a) its integral 
representation (1.31), we obtain 

PM =47T* p*dpff do, x do, 2 TV*(Pi o > 0 ^) 

X r tt, (p; <o 0 ', to a ) fo*{p, 0, x ) &(/>, <°»). 


(1.41) 
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where the integrations in and co 2 extend over the unit sphere in three 
dimensions. The request that (T {1) ifj Q ')~(p t to') be continuous can be met 
by demanding that T {1) (p ; a/, w) be continuous in a/. 

At this stage one must remember that the expression (1.41) is the 
probability density of scattering of two particles in the relative direction 
aj 0 ' with respect to one another. We have already indicated, however, 
that in a scattering experiment a beam of particles is being scattered on 
a target consisting of a large number of particles. Thus, in a scattering 
experiment, a great many two-body scatterings are observed simul¬ 
taneously. These scatterings between particles in the beam and particles 
in the target are taking place at different locations with respect to the 
laboratory frame of reference. Hence, in order to arrive at an expression 
for a planar differential cross section, an averaging of P(oj 0 ') over all these 
two-body scatterings at different locations has to be carried out. 

If a particle © 2 " in the target is at a location r 0 in relation to another 
particle S 2 ', while © 2 ' is in position r with respect to a particle in 
the beam, then © 2 " is at the location r + r 0 in relation to the incident 
particle © x . Since, with the exception of location, all the other variables 
of © 2 ' and © 2 " are the same, we conclude that if 


0c(R) </'o( r )» 


m 1 r t + tn 2 r 2 
m 1 + m 2 


r = r 2 — r ! , 


represents ¥ /in (0) for (2^ , S 2 '}, the wave function 


iMR) K(r) = *( R + ^ r °) Mr + r » ) 

represents ^“(O) for {Sj, S 2 *}- taking Fourier transforms, for 
{®i , ® 2 ,/ } we easily get 

^r 0 (p) = exp( tpr 0 ) I?„(p), 


where 0o(p) refers to {Sj, ® 2 '}, and $ rQ (p) refers to {Sj, © 2 "}- Hence, 
the probability P ro (aj 0 ') for {©j , S 2 "} is 


(1.42) P l0 K') = 4^ f dpp*j dw, dco 2 T*(p ; o» 0 ', oj,) 


X T(p; «V, <o 2 ) exp[j'r 0 (p 2 - p x )] <p n *(Pi)>J’o (Pa)!^-* 


instead of (1.41). 

We shall now make the assumption that the scatterers in the target 
are uniformly distributed, so that A^ 0 scatterers interact with the incident 
particles per unit target volume and unit time. Then the total number 



434 


V* Quantum Mechanical Scattering Theory 


of particles scattered* per unit time and solid angle in the direction co 0 ' 
is on the average equal to 

(1.43) 

K«o') = -Nof P '«M dr n 

J y 

= 4t t 2 N 0 J ^dr 0 J dpp 2 j do h dcu 2 exp[«r 0 (p 2 - p x )] T (1) * (p; o' 0 .to,) 
X T a) (p; co 0 ', a> 2 ) <Ao*(Pi) *?o(P2)Id 1 -j> 2 = j > » 


assuming that the scatterers in the target are sufficiently dense that we 
may, with sufficient accuracy, replace the sum over the individual 
scatterers with an integral over the volume V of the target. 

We shall evaluate the integral (1.43) under the assumption that the 
target is a slab of thickness 8 , which extends to infinity in the directions 
x Q and y 0 (see Fig. 5). Setting k = p 2 — Pi and reversing the orders of 
integration in ( 1 . 43 ), we obtain 

(1.44) v(oj 0 ') = 477 2 iV 0 j* dcxj x j dr 0 jdp duj 2 p 2 e tkT « F(oj 0 '; p, , cu 2 ), 

where we have introduced the function 


(1.45) F(u>q\p, , OJ 2 ) 

= T { 1 ) *(p\ aV, ^i) T { 1 ) (p\ a V, W 2 ) $£*(/>, cjjfaip, c«i 2 ). 

Since I Pi I = I p 2 I — P> only two of the components of k are 
independent. We can substitute for co 2 the variable w which represents 
the spherical coordinates 6 and <j> of p 2 in a coordinate system with its 
z axis in the direction of p x . Hence, p lx — p 2a . = 0 and 

K = p 2 x = P sin 0 cos<£, 
ky = p 2 y — P si n 0 si n <£• 


By substituting for co = ( 6 , </>) the variables k x and k y , we obtain 


p 2 dp dou = p 2 sin d dp dO d(f> = p 2 sin 6 ^ 7 ^ —dp dk x dk y = dp dk x dk y . 


d(k x , k y ) 


* In practice, the scatterers in the target are approximately at rest with respect to the 
laboratory frame, so that the relative scattering direction w 0 ' computed in the center-of- 
mass frame is very often almost equal to the scattering angle measured in the laboratory 
reference frame. 
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Using the properties of Fourier transforms applied to the variables 
k x , k y and their conjugates x 0 and y 0 , we obtain 

J r2 dx o dy 0 J dk x dk y e !r ° k J dp />, co t , a> 2 ) 

= 4^2 f ^1^2) d 

J A. cos $ =fc..=n 


Since obviously dy 0 dz 0 = dr 0 , the above relation can be used in 
computing the integral in (1.44). When k x = k y = 0, we have fa = fa 
and = 0 2 or ^ = fa and = 77 — 0 2 . Since F(o> 0 '; p, , co 2 ) 
vanishes when or a > 2 are outside 1^ , it follows that when / ajQ is 
sufficiently small only the first alternative fa = fa and — 0 2 yields 
a nonzero result. This means that k z = 0, 0 — 0, and the integration 
in z 0 in (1.44) can be immediately performed: 


(1 .46) K<V) = ( 2 ? 0 4 N o& J doj i j dpF(u> o'; p , , oji) 

S> J 7 *o 

= (27t ) 4 7V 0 S f ^ f | £ (/) , Wl )|2 p 2 ^. 

J 7 *> 0 J 7 *»o P 


We note that iV 0 8 = iV x is the total number of scatterings per unit time 
and target area, i.e., the incident flux J 0 . Let us choose Ap 0 , A0 o , and 
Afa very small, and the direction of the incident beam along the z axis, 
so that Sq — fa = 0. Then, in the limit 


Ap 0 , A0 O , Afa -* +0, | <£ 0 (p)l 2 -> 6 3 (p - p 0 ), 


we get by the mean-value theorem of the integral calculus 


lim v(a, 0 ') = /„ | T m (p\ <o 0 \ w 0 ) 

f 0 l 2 ->« 3 P 


= ( 0 » 0 ) 


We note that the application of the mean-value theorem is warranted 
on account of the implict assumption that T {1) (p ; a/, a>) is continuous 
in some neighborhood of a/ — ( 0 , 0 ). 

Since v(oj 0 ') is the number of scatterings off an infinite planar* target 
(cf. Fig. 5) in the direction co 0 ' per unit time, we have by (1.24) 


<j(E, <« 0 ') = 4- lim v(co 0 '). 

Jo l^ol 2 **® 3 

# By qualifying the above differential cross section as “planar” we depart from con¬ 
ventional terminology, which in the quantum context recognizes only the cross section 
given by Eq. (1.47) or, equivalently, by its more standard counterparts (4.98) and (4.99) 
(see §4.10 for a discussion of this point). 
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Thus, we have obtained the expression 


(1.47) o(E ,«) = | «, «o)|*, 

Po 


E = («1 + Wg) j>0» 


for the planar differential scattering cross section under the assumption 
that T {1) (p ; co/, c^) is continuous in co/ and co x when co/ is in a neigh¬ 
borhood of co' and co x in a neighborhood of co 0 = ( 0 , 0 ) for a given 
P G [0, 00). 


Exercises 

LL Let r| x + v| x £, & = 1, 2, be the asymptotic states for the state 
r k {t ), & — 1, 2, of a classical two-particle system. In the center of mass 
coordinates (7.2) of Chapter II the state of the system is given by R(£) 
and r(£), where R(Z) = R c + V c t describes the uniform motion of the 
center of mass of the system. Show that if r ex = r| x rf x , 
v ex = v| x — v® x , then | r(£) — (r ex + v ex t)\ -> 0, (l/t) r(£) -> v ex and 
| r(£)| — t | v ex | -> r ex • v ex /| v ex | when t -> Too, respectively. 

L2* Prove that a system of two particles interacting via a potential 
V(r) = 1 \r (gravitational or Coulomb potential) does not possess any 
asymptotic states satisfying (1.5) and (1.6). 

L3* Show that the adjoint K * of a bounded integral operator K , 
(Kf)(x) = f K(x,x')f(x')d”x', 

defined on L 2 ([R n ), is an integral operator of the form 

(K*g%x)= f (K(x',x))*g(x')d”x'. 

J n« n 

L4* Prove that if ¥ fex (^) e M 0 (ex = in, out) are the asymptotic 
states of x ¥{t), and S is the scattering operator, then ^^(O) = *S'¥ /in (0). 

L5* Show that the formula (1.47) for the scattering cross section 
stays valid in the general case when ^“(O) is represented by an arbitrary 
wave function sHp.p). which is not necessarily in the form of the 
product occurring in ( 1 . 33 ). 

L 6 * Prove that if T a) (p ; co, co') and T^\p\ co, co') are two kernels 
satisfying (1.31) for a given operator T a) , then 3 n j ) 1) (/>; co, a>') — 
T (1) (p; co, co') almost everywhere with respect to the measure 
li™ X I* p X 5 on [0, + oo) X Q v X Q v . 
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L7* Suppose ¥*( t) and (£), t e U\ are vector-valued functions 
assuming values in the Hilbert space and that there is a constant C 
such that || < C and || ¥ f 0 (^)|| ^ C for all t e U 1 . Let A be a bounded 

operator on J#*. Show that lim HToo || } F(t) — ^(Oll ~ 0 implies that 

lim [<¥^(0 | AW(t)) - (W 0 (t) | AW 0 (t))] = 0. 


L8* Use the result of Exercise 1.7 to prove that 

lim (W(t) | E™(B) ^(O) = <^ex(0) | E*\B) ^(O)), 

t-> ^f^co 

where ( Hq 2) i/>)~(P, p) = (P 2 /2M 0 + p 2 /2m 0 ) ^(P, p), with M 0 and m 0 
denoting, respectively, the total and the reduced mass of the system. 

L9* Consider the case of one-particle wave mechanics, when 
/7 0 D (l/2m)A. Using the well-known relation 

(exp(— (k) = exp[—i(k 2 l2m)t] k) 

derive that, for functions ifs( r) which are Lebesgue integrable as well as 
square integrable, 

(exp(— iH 0 t)ijj)(r) = (m\2Triif 12 f exp[z(/ra/2J)(r — r') 2 ] 0(r') dr' 
almost everywhere in U 3 . 

L10* Verify that the operators t/j 0 and Uf \ defined for real t ^ 0 
by 

(U^tft)( r) = (mlit) 312 exp[im(r 2 l2t)] ijj(mrlt), 

(t/ t <2 V)(r) = exp[ im(r 2 l2t)] ^(r), 

are unitary. Use the result of the preceding exercise to show that for 
// () 2 — (l/2m)A 

e~ iHot = Ul x) Ul 2) . 

1.1 L Use the result of Exercise 1.10 to prove that the operator 
families and C/| 1) defined there satisfy 

Hm Wie-^- C/ t (l V 11=0 


for all i/j gL 2 (R 3 ). 
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1*12♦ Consider the cones 


C (+) = {(r, 6,4): r > 0, 0 O < 0 < 0 O + + A<j>} 

and C (_) = (r: —r e C+) in R 3 . Use the results of Exercises 1.7 and 1.11 
to prove that for H 0 D —(1 /2m)A 

lim f |(«- ur °V)(r)| , *= f |ftk)|«A. 

HT® J c ( +) J C ( T) 

Remark. This result indicates that asymptotically in the future the 
probability of finding the particle spatially located in the cone C (+) 
is equal to the probability of finding its momentum within C (+) . 

2 ♦ General Time-Dependent 
Two-Body Scattering Theory 

2.1. The Intertwining Property of Wave Operators 

In §1 we defined the Moller wave operators Q ± by means of strong 
limits of Q{t) y 

Q± = s-lim Q(t) Em , 

t->Too w ± 

( 2 . 1 ) 

Q(t) = e iHt e~ iH »\ H = *, H 0 = if 0 *, 

in the time parameter t . In this definition, the closed linear subspaces 
M ± of are spanned by all vectors/ which are such that / is a free 
state and the respective strong limits s-lim^ Too Q{t)f exist.* The version 
of scattering theory based on this procedure of computing the wave 
operators, and eventually the scattering operator 

(2.2) 5 = Q_*Q + , 

is called time-dependent scattering theory. 

In this section we shall derive some of the basic properties of wave 
and scattering operators, which follow from the general definitions (2.1) 
and (2.2), and which do not require the specification of H 0 and H within 


* A mathematically common (but physically unfounded) way of defining Q± features 
instead of in (2.1), where S^ c is the absolutely continuous spectrum of H 
(in practice, , though in general C S^). Since, in general, the strong limit 

of Q(t)E H ( S^) for t -> oo might not exist, this alternative definition gives rise to the 
question of existence of wave operators. In our version, the wave operators always exist, 
but one must check in each particular case on the size of M* and (see §§7.1-7.2). 
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the confines of some particular model. It can be easily seen by comparing 
( 2 . 1 ), ( 2 . 2 ) with (1.18), (1.28), and (1.29) that the derived theorems apply 
to the operators Q {2) and S {2) as well as to their counterparts and S {1) 
in the “internal motion” Hilbert space Naturally, in the first 

case one should take H = H {2) and H 0 = H^ 2) , while in the second case 
one should take H 0 = and H = H a K 

Theorem 2*1* The wave operators Q ± have the intertwining 
properties 

(2.3) Q ± e itH » = e itH Q ± , teU\ 

(2.4) Q ± E h \B) = E h {B) D ±9 Be 
If fe @ Ho , then 

(2.5) Q±H 0 f = HQ±f. 

Proof. If f x belongs to the respective subspaces M ± , then according 
to the definition of M ± , the respective strong limits of &{tf)f x exist 
when t 1 —> =poo. Thus, we can write 

|| exp (iHt) D±f 1 - s-limfexp^i) exp (-iH^)] exp(*77 0 *)/i || 

= lim || exp (iHt) Q±f 1 — (exp(^) exp(-z7/ 0 ^)) exp(tff 0 f)/i || 

= lim || exp (iHt) &±f x — exp (iHt) exp [iH(t x — £)] exp [—iH 0 (t x — ^)]/ x || 

= II Q ± f x - exp[*#(<! — <)] exp— t)\f 1 1 | = 0. 


Hence, cxp(iH 0 t) also belongs to M ± , and since 

s-lim exp (iHt x ) exp (—iH Q t^) ^xp{iH 0 t)f x = D± , 


we have proved that 

Q ± exp(*77 0 *)/i = exp (iHt) Q±f 1 , A e M ± , 

for all t e R 1 . 

Now, if / 2 e © M ± , exp (iH 0 t)f 2 also belongs to . In 

fact, for all f x e M ± 

<exp(iW 0 0/i l/i> = </a I exp(-i7y 0 f)/i> = 0 , 
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since exp (iH 0 t) f x e M ± according to the preceding discussion. Thus, due 
to definition (2.1) of Q ± , we have Q ± f 2 = 0 and Q ± exp (iH 0 t) f 2 = 0. 
Consequently 

(2.6) Q± exp (iH 0 t)f 2 = exp (iHt) Q±f 2 = 0, f 2 e M± . 

Since every fe can be decomposed in the form 

/=/i+/«. A = V eMi - / 2 = v eM i> 

we immediately obtain (2.3) from (2.5) and (2.6). 

The relation (2.4) can be easily derived from (2.3) by methods 
frequently employed in Chapter IV (see Exercise 2.1). 

Finally, by making use of the result (3.6) in Theorem 3.1, Chapter IV, 
and of the fact that Q ± are bounded and therefore continuous, we easily 
derive that for fe ^Hq 

®±Hof = &± s-lim 1 (exp(iH 0 t) — 1)/ 

— s-lim ~ ( D± exp (iH 0 t) — D ± ) f 

= s-lim (exp (iHt) Q± — 0±)f 

== s-ljm ~ (exp (iHt) - 1) Q ± f = HQ±f. Q.E.D. 

2.2. The Partial Isometry of Wave Operators 

We recall that in the last section M ± were defined to be the sets of all 
vectors feJ^Q for which the respective limits 

/ ± = s-Hmr2(0/ 


exist, while R ± are the sets of all vectors f ± . According to Lemma 1.1, 
the sets M ± are a closed linear subspace of Jtf*. We shall prove the same 
statement for R ± , by showing that Q ± are partially isometric operators. 

Definition 2.1. A bounded linear operator U defined on the 
Hilbert space is said to be partially isometric with initial domain M 
and final domain N if U maps M isometrically onto N and maps 
M x = je 0 M into the zero vector, i.e., if || Uf\\ — ||/|| for fe M and 
|| Uf || = Ofor/E © M. 
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Theorem 2*2. The ranges' of Q ± are closed linear subspaces of Jf 7 ; 
the wave operators Q ± are partially isometric with initial domains M ± 
and final domains R ± , respectively. 

Proof. According to the definition of R + , any vector f + e R + is the 
strong limit 

f+ = s-lim Q(t)f = Q + f 

for some vector fe M + . Moreover, since Q(t) is unitary, 

(2.7) \\Q + f\\=\im ||^(0/ll=ll/ll 

if/eM + . Hence, the restriction of Q + to M + has an inverse , 
which is bounded on account of (2.7). 

To prove that R + is closed, let h e be the strong limit of a sequence 
h x , h 2 ,... e R + . By (2.7), f x = , / 2 = 0^h 2 ,... is a Cauchy 

sequence of elements from M + . Since M + is closed, /! ,/ 2 has a limit 
fe M + . Therefore, h = ^ + /e R + , which proves that R + is a closed set. 

The linearity of R + is an obvious consequence of the linearity of . 
Thus, R + is a closed linear subspace of J^,and Q + maps M + isometrically 
onto R + . Moreover, if g _L M + , then 

Q+g = s-lim Q(t) E M+ g = 0. 

Thus, Q + is partially isometric with initial domain M + and final 
domain R + . 

The corresponding result for R_ is established in a similar 
manner. Q.E.D. 

Lemma 2*1* If U is a partially isometric operator on with 
initial domain M and final domain N, then £/* is a partial isometric 
operator with initial domain N and final domain M, and the following 
relations are true: 

(2.8) U*U = £ m , 

(2.9) UU* = E n . 

Proof. If g _L M, then Ug = 0 and consequently U*Ug = 0. 
Hence, life M, then 

<g\U*Uf'> = <U*Ug\f'>=0 

for all g e M 1 , and therefore U*Ufe M. Since M is linear, we also have 
/ — U*Ufe M for any fe M. On the other hand, 

<* 1/ - U*Uf> = (h |/> - <Uh | Uf> = 0 
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for all hyfeM (see Exercise 2.2), which implies that/ = U*Uf. Thus, 
U*Uf = f for all/e M and U*Ug = 0 for all g J_ M, i.e., £/*£/ = E M 
and (2.8) is established. 

We shall prove now that £7* is partially isometric. If h e N, there is 
a unique fe M such that h = Uf, ||/|| = || h ||. Consequently, 
U*h = U*Uf = f by (2.8). On the other hand, if h x _L N, then 
<[/*/?! l/i) = | [//*!> = 0 for all f x e Jt?, since Uf x e N always. 

Consequently, £7*/^ = 0 whenever h x J_ N. Thus, £7* is partially 
isometric with initial domain N and final domain M. 

By reversing the roles of £7 and £7* in (2.8) we show that (2.9) is 
true. Q.E.D. 

If we apply this lemma to the operators Q ± , we arrive at the following 
relations: 

(2.10) Q ± *Q ± = E m±9 

(2.11) Q±Q±*=E r± . 

In particular, if M + = M_ = M 0 = Jf 7 , we would have Q ± *Q ± = 1. 
We shall see in later sections that in problems of physical interest 
Mj) 2) coincides with the subspace J^ {2) © Jf^ 2) of all vectors which are 
orthogonal to the subspace = Jf ?( c 1) ® ^ob> where [see (1.26)] 

•2fob C ^ (1) is the space spanned by the eigenvectors of the “internal” 
free Hamiltonian Hq\ ] given in (1.28). Ordinarily, Hq} ] has a pure 
continuous spectrum, in which case Jf^ 2) = {0} and M{> 2) = Jf’ (2) . 

2.3. Properties of the S' Operator 

We turn our attention now to the S operator, giving special attention 
to the practically very important property of its “unitarity.” 

Theorem 23 ♦ The S operator commutes with the free Hamiltonian 
H 0 , and ¥ /out (^) = SW in (t) if W G *(t), ex = in, out, are the strong asymp¬ 
totic states of some W(t) = e iHt W(0 ). 

Proof ‘ Let E H <>(B) be the spectral measure of H 0 . Using the inter¬ 
twining property (2.4) and its equivalent 

(2.12) Q±*E h {B) = E h \B) Q±*, B e 
derived from (2.4) by taking adjoints, we obtain 

E h °(B)S = E h °(B)Q_*Q + = Q_*E h (B)Q+ = Q_*Q + E h \B) = SE h \B ). 
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Thus, S commutes with H 0 . Since by (2.1) [cf. (1.14)], ¥*(0) = fi ± y«(0) 
so that ¥^(0) — R + n R_ , we get = *S'¥ /in (^) at t = 0 by using 

(2.11). By virtue of [ S , H 0 ] = 0, this stays true for all t e R 1 . Q.E.D. 

We recall that Theorem 2.3 has been used in proving that (1.30) is 
not feasible. In Theorems 2.4 and 2.5 we show that 5*5 = E Mo if 
M 0 = M ± . Thus, the condition that 5 ^ 0 coincides with the condition that 
M 0 ^ {0}, i.e., that there are asymptotic states and that the wave 
operators do not vanish. 

Theorem 2*4* The S operator maps the subspace M + into M_, 
and maps its orthogonal complement © M + into zero; moreover it 
maps M + isometrically into M_ (i.e., || Sf\\ = ||/|| for all fe M + ), if and 
only if R + C R_ . 

Proof ’. According to Lemma 2.1, the final domain of Q _* is identical 
to the initial domain M_ of Q_ . Therefore, Sf = Q_*(Q + f) e M_ for 
all / g Jf?. However, if/ _]_ M + , then Q + f = 0 and consequently Sf = 0, 
which establishes the first statement in the theorem. 

According to Theorem 2.2, Q+ maps M + isometrically onto R + , 
i.e., ||jQ + £|| = \\g\\ for ^eM + . By Lemma 2.1, £?_* maps R_ iso¬ 
metrically onto M_ . Thus, we have || Q_*Q+g || = || Q+g || = || g || 
for all g g M + if R + C R_ . However, if R + © (R_ n R + ) ^ {0}, then 
the elements of R + © (R_ n R + ) are mapped by Q_* into zero, and 
S = QJ*Q+ is not isometric on M + . Q.E.D. 

The above theorem indicates that the S operator might be unitary if 
M + = M_ — M 0 and S is restricted to M 0 . Theorem 2.5 gives an 
important necessary and sufficient condition for this case to occur. 

Theorem 2*5* The S operator is partially isometric with initial 
domain M + and final domain M_ and satisfies the relations 

(2.13) 5*5 = £ m+< 55* = £ m _ 

if and only if R + = R_ . 

Proof. Assume that R + = R_ . Using 

Er ± &± = Q ± , 

and the identity (2.11), we obtain that (2.13) holds: 

5*5 = Q_*Q_Q„*Q+ = Q+*E k _Q+ = Q + *E R+ Q + = = £ M+ , 

55* = = Q_*E r Q„ = Q_*E K _Q_ = Q_*Q_ = . 
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Now, by Theorem 2.4, when R_ = R + , the S operator is partially iso¬ 
metric, with initial domain M + and a final domain equal to its 
range 9t s • Since, according to the same theorem, Sts C M_ , we obtain 
by using (2.13) and Lemma 2.1 that St s = M_ . 

Conversely, assume that the relations (2.13) are true. We get by 
expressing in (2.13) S'*, S', and in terms of the wave operators 
Q ± : 

Q + *Q_Q_*Q + = Q + *Q + , Q_*Q + Q + *Q_ = Q_*Q_ . 

Multiplying the above two relations from the left by Q ± and from the 
right by £? ± *, respectively, we obtain, after using ( 2 . 10 ) and ( 2 . 11 ), 

(2.14) E R E R E R+ = E 2 R+ =E R+9 

(2.15) E r _E r+ E r _ = E 2 R _ = E r _ . 

Thus, we have for any f e 

II £r + /II 2 = </l £r + /> = </l E R+ E R _E R+ f} 


= <E K f\ E R _E R J} = || E R _E R+ f || 2 . 


The above equality implies (see Exercise 2.3) that E R f = E R _E R f for 
all/e i.e., 

Er_E r+ = E r+ = E r+ = E r+ E r . 

Inserting the above relation in (2.15), we get 

E r = E r _E r+ E r _ = E r _E r+ = E r _E r+ = E r+ . Q.E.D. 

In the case of two-body potential scattering, i.e., scattering theory 
in which H and H 0 are essentially of the form (1.8) and (1.9), we will be 
able to prove that indeed M + = M_ = M 0 and R + = R_ when the 
potential V{t) fulfills certain conditions. However, it is still interesting 
to investigate the physical consequences of a theory in which R + ^ R_ , 
i.e., when the S operator is not a partial isometry from M + to M_ . 

Let us assume that £ 0 is a vector which belongs to R + but does not 
belong to R_ . According to the definition of R + , the fact that £ 0 e R + 
implies that there is a vector £ in e M + such that £ 0 = &+gm » i*e., 

lim || e- im g 0 - e-W g in || = lim || g 0 - e iHt e^'g* || = 0 . 

oo £->—oo 

Hence, the interacting state e~ iHi g 0 has an incoming asymptotic free 
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state. This means that e~ iHt g 0 is not a bound state. However, since 
go $ R- > 

s-lim e iHt e~ iHat g 0 

t~>+cO 60 

does not exist. Hence, by Theorem 1.1 e~ iHt g 0 h 2 isno outgoing asymp¬ 
totic state. Thus, e~ iHt g 0 is a “quasi-bound” state, which is approximately 
a free state in the distant past, but which does not become a free state 
again at any time in the future (see Pearson [1975] for examples). 

On the other hand, if there is a vector g x e R_ which does not belong 
to R + , then the theory possesses an interacting state e~ iHt g x which 
becomes a free state in the distant future, but wich has never been a 
free state in the past. Thus, the condition of “unitarity” of the S operator, 
which is usually imposed on scattering theories, is equivalent to the 
requirement of not having any quasi-bound states in the theory. 

2.4. Initial and Final Domains of Wave Operators 

Thus far in this section we have never used the basic assumption that 
a state e~ iHo 1 g is called a free state only if it is not a bound state of the 
Hamiltonian H 0 . As a matter of fact, all the theorems of this section 
stay valid if we define M ± to be the sets of all vectors fe 34? for which 
the respective strong limits of Q{t)f , t -> T- °o, exist. If M ± are defined 
this way and we choose H identical to H 0 , we would have Q(t) = 1 
and M ± = M 0 = 34?, always. However, if H 0 has bound states, then not 
all e~ iH o 1 f, fe34?, are free states, and M 0 defined in §1 is a nontrivial 
subspace of 34?. 

Let M x = E H (S I f)34? denote the closed subspace of 34? onto which the 
spectral measure E H ( Sf) belonging to the continuous spectrum of H 
projects. 

Theorem 2*6* The closed subspaces R ± of 34? are also subspaces 
of M x — E h (Sq)34?, and are left invariant by E H (B), i.e., E H (B)fe R ± 
for all B e 3S 1 whenever / eR ± . 

Proof. Since R ± are the ranges of Q ± , we can find for any g ± e R ± 
vectors f ± , such that g ± = Q ± f ± . Hence, using (2.4), we get 

E h (B) g ± = Q±E H °(B)f± 6 R± , Bern, 

which establishes that R ± are left invariant by H. 

We shall prove that R ± C M x by showing that Q ± f ± is orthogonal to 
all the eigenvectors of H. 
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Let g be an eigenvector of H with the eigenvalue A 0 , so that 
E H ({h 0 }) g = g- Given any f x e M ± , we can write 

<£ I a±f^> = <E H m)g I ^±/i> - <n±*E H ({\ 0 }) g I A) 

= <E H \{\})Q ± *g |/i> = <g\ Q±E H \{X n })f i y = o, 

where we have made use of ( 2 . 12 ) to derive the third equality; the last 
inner product vanishes because E H »({ A 0 }) f x = 0, since no f x e M ± is an 
eigenvector of H 0 . 

Since Q±f 2 = 0 whenever/ 2 J_ M ± , we derive, by setting/ = f ± + / 2 , 
A 6 M ± , / 2 _L M ± , 

<£ I a±f> = <g I fl±/i> + (g I fid=A> = o 

for any fe Hence, R ± are orthogonal to any eigenvectors 
gofH. Q.E.D. 

We will be able to prove in later sections that in potential scattering 
with certain classes of potentials, we have R + = R_ = M x . In order to 
arrive at results of this kind, one usually starts by proving first that 
R + = R_ . This establishes immediately that the S operator is unitary. 
Now we shall prove a theorem which states a sufficient condition for the 
identity of R + and R_ . This theorem applies to operators with a simple 
continuous spectrum. 

The concept of one operator A with a simple spectrum was introduced 
in §5 of Chapter IV: the self-adjoint operator A has a simple spectrum 
if the one-element set {^4} is a complete set of operators. According to 
the theorems of §5 in Chapter IV it is equivalent to say that A has 
a simple spectrum if there is a vector / 0 which is cyclic with respect to A . 

If and Sp denote the continuous and point spectrum of A , then the 
closed subspaces J^ A = E A ( S A )J^ and Jt? A = E A { S p )Jf onto which 
Z^S^) and E^(Sp) project, respectively, are left invariant by A. To see 
that, note that ® #P A and that 

<J\A£> = f^<J\Ei g > =0 

J —00 

if/e J^ A and g e #P A n Of A , or if / e J^ A and g e n . 

The restrictions A 0 and A p of A to J^ c and , respectively, are 
obviously self-adjoint operators with spectral resolutions 

(2.16) A„ = j A dEf , E\B) = E A (B n Si), 

(2.17) A v = J A dEfc, E\B) = E A (B n S£), 
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respectively. When A c or A p are operators with a simple spectrum, we 
say that A has a simple continuous spectrum or a simple point spectrum , 
respectively. It is easy to establish that A has a simple spectrum if and 
only if both its continuous spectrum and its point spectrum are simple 
(see Exercise 2.4). 

*Lemma 2*2* Let i be a self-adjoint operator with a simple 
spectrum, and let M*., k = 1, 2, be invariant subspaces of A , i.e., for 
any f e M k n @ A we have Afe , k = 1, 2. If the restrictions A k , 
k == 1, 2, of A to Mfc have identical spectra, then M x == M 2 . 

Proof . Since M*. is left invariant by A , the operators A and E Mfc 
commute (see Exercise 2.5). Hence, by Theorem 5.6 of Chapter IV, 
the projector E Mfc is a function of A , i.e., E Mfc = F k (A ). From the 
properties F M ^ = F$ fc and E ^ = F Mfc , by using (2.26) of Chapter IV 
we prove that F k ( A) is a real function satisfying 

f /M d || Efff = f F*(A) d || Eif\\\ fe 
•'ds 1 •'r 

and therefore -F^A) can assume only the values 0 and 1 for A e S A . Thus, 
if we setF k (X) = 0 for A ^ S A , we can state thatF^A) is the characteristic 
function of a Borel subset B k of S A . 

We obviously have 

A k = AEtf k = AF k (A). 

Hence, the spectrum S Ajc of A k coincides with B k . Consequently, 
S^i = S ^2 implies that F X {A) = F 2 (A), i.e., F Mi = F Mg . Q.E.D. 

The following theorem can be very useful in establishing that S is 
unitary. 

Theorem 2.7. If the continuous spectrum of H is simple and 
M+ = M_ , then R + ~R_. 

Proof . Let H 1 be the restriction of H to the closed subspace 
M x = E h (Sq)3# ? . The operator H ± is obviously a self-adjoint operator 
with a pure continuous spectrum which is identical to the continuous 
spectrum of H. Hence, according to the assumption, the spectrum 
S ,f » of H x is simple. 

According to Theorem 2.6, R ± are closed subspaces of M x which H 1 
leaves invariant. Since the spectrum of H x is simple, by (2.4) and (2.11) 
we can apply Lemma 2.2 and infer that R + == R_ . Q.E.D. 
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2.5. Dyson’s Perturbation Expansion 

We saw in §1 that the knowledge of the S operator is sufficient for the 
calculation of the differential cross section and of transition probabilities. 
This shows that a complete solution of the dynamical problem, which 
would consist in computing the evolution operator U(t , t 0 ) introduced 
in §3 of Chapter IV, is not necessary for the theoretical description of 
a scattering experiment. Since the computation of W(t) — U(t , t 0 ) *F 0 
for given can be a difficult and intricate computational task, it is 
desirable to develop methods for computing the S operator in a direct 
manner. Dyson’s perturbation theory provides one such method. 

It follows from (2.1) that for any/, g e 

(2.18) </1 | £> = lim <fl(0 E„J \ g > = lim </1 E n Q*(t) g>. 

£-> + <» t->+CO 

Since Q*(t) is unitary, we have || £?*(*)£ || = WgW* On the other hand, 
if g g R_ , it follows from the fact that Q _* is a partially isometric 
operator with initial domain R_ (see Theorem 2.2 and Lemma 2.1) 
that || Q-*g || = H^ll, so that || Q*(t)g || = || Q_*g ||. According to 
Lemma 6.2 of Chapter IV, this result in conjunction with (2.18) implies 
that ^ 

Q_* g = s-lim £ M J2*(0 g, g e R_ • 

£-»-fao 

Hence, if S' is partially isometric, so that R + == R_ , we have 

(2.19) S = £ M s-lim f2*(J)(s-lim Q(t 0 ) £ M ) 

£-*+°° to - *— 00 + 

= £ m s-lim(s-lim Q*(t) Q(t 0 )) Em , 

£-»+°o £ 0 ~»—oo + 

where the operator 

(2.20) V(t, t 0 ) = Q*(t) Q(t a ) = e iH ^ e ~ iH(t ~ h) e" 41 *^ 

is obviously unitary for any t , t {) e U 1 . 

Comparison of 

Q*(t) = e iHot e~ iHt 

with (3.43) of Chapter IV shows that £?*(*) is the operator which deter¬ 
mines the time dependence of the state W(t) in the interaction picture. 
Hence, we have (see Chapter IV, Exercise 3.4) 

(2.21) —= s-lim = -iHMQ*(t)f, 

(2.22) Ih(t) = - H a ) 

for any / £ &! n n f ./i H . 
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We have seen in §7 of Chapter IV that in potential scattering, under 
very reasonable assumptions on the potential, coincides with . 
When this is the case, and the vector / belongs to ~ , we have 

exp( —f e S) H and exp (iHt 0 ) exp(— iH 0 t 0 ) f e @) h (see Exercise 
2 .6), so that Q(t 0 ) f e n 3 # Hq and 


(2.23) 


8 V(t, to ) , = s lim V(t + M, h) 
8 t J Jf-»o At 


v(t, t 0 ) 


f = -rn.it) v(t, t 0 )f. 


Thus, for any vector h e 


(2.24) (d/dtXh | V(t, t 0 )f > = -i<* | ^iW V(t, t 0 )f). 

If the function (h \ H^t) V(t , £ 0 )/> * s integrable in t , we obtain from 

(2.24) , after having observed that V(t 0 , t 0 ) = 1, the following relation: 

(2.25) <A | V(t, t 0 )f> = <* |/> - i f <* I Hi(*i) V(t,, t 0 )f> d h . 

J to 

Since <A | F(£, £ 0 )/> an d |/> are bounded functionals, 

-* ( (h | H.it i) F& , f 0 )/> dh 

is defined for any h e Jf", it determines for each fixed/ e Sj Hq a continuous 
antilinear functional in A, and therefore by Riesz’s Theorem (Theorem 
2.3, Chapter III), it defines a mapping/ h-> W^t, t 0 )f. It is easy to see that 
W^ty t 0 ) is a linear operator defined on Sj Hq . We write, symbolically, 

(2.26) WJt, to) = -i f Hfa) V(t x , to) dt x . 

J to 

In this notation (2.25) is equivalent to the statement that 

(2.27) V(t, t 0 )f = (l - * J* Hfa) V(h , t 0 ) dty) f, f e . 

If we treat the above integral equation recursively in V(t f t 0 ) f we obtain 
after n steps 

V(ty t 0 )f = [1 + Vi(t, to) + - + to) + w n (t, t 0 )]f, fe® Ho , 

where V k (t y t Q ) and W n (t, t 0 ) are defined by the relations 

(2.28) V k (ty t 0 ) = (-*)* f dt x f 1 dt 2 dt k Hfa) - Hfa), 

to t o ifl 

(2.29) W n (t, t 0 ) = (-*•)» f‘ dty f 1 dt t f *"" 1 dt n - H y {t n ) V(t„ , to), 

J t, J t 0 •'to 
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whose precise meanings are established in the same manner as in the 
case of W x {ty 1 0 ). 

It has become customary to introduce a chronological ordering 
operation T by setting 


(2.30) 


T[^i) ffiW) - Hfa)] = H^) H^) Hit ik \ 


> t h > 




This operation orders the time-dependent operators in the product 
H x {t^) ••• H&b) in a chronological sequence of increasing values of the 
time variable from right to left, so that the integrand in (2.28) becomes 
symmetric under permutations of t x t n when T is set in front of it. 
Since there are n\ such permutations, it is easy to check that 

(2.31) V k (t, t 0 ) = f* dt, f* dt 2 ••• f* dtJiHAtJ Hfa) - //,(4)]. 

n i J t 0 J t 0 J to 

Suppose that ^ 

(2.32) V k f = w-lim (w-lim dt,- dtJiH^) - H,(4)]/) 

= 4i f_ ^ - J. dt ^ H ^ - H &*)v 

exists for every k — 1 , 2 ,..., and that 

(2.33) W n f = w-lim(w-lim W„(t, t„)f) 

t-*+co t 0 ->— 00 

also exists and converges (at least in a weak sense) to the zero vector 
when n->+oo. Under these assumptions we obtain from (2.19), 
(2.20), (2.30), and (2.31) the following expansion: 

Sf=E M _(l + V x +Vi + -)E M+ f. 

The above expansion is known as a perturbation expansion of the 
S' operator. In the zero order for the case M ± = M 0 , it becomes 
S = Eh o = £ , Mq , i.e., it describes the case when there is no scatter¬ 
ing. The computation to higher orders is facilitated in practice by an 
explicit knowledge of the commutator H Y {t 2 )] = D(t x , £ 2 ). 

This makes possible the computation of the integrand of V k by successive 
use of the relation 


TtffrfO ffiW)] = “ h) D(t t , f a ). 
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Computations of this nature occur very frequently in quantum field 
theory, and techniques have been developed for coping with them in 
a systematic manner. 

2.6. Criteria for Existence of Strong Asymptotic States 

There are cases even in potential scattering in which M ± = {0}, i.e., 
when there are no asymptotic states in the sense of (1.7). One notable 
example is the case of Coulomb scattering, when we are dealing with the 
Schroedinger operator H with a Coulomb potential. Namely, it can be 
shown that quantum Coulomb scattering shares with the classical 
Coulomb scattering the feature of not having asymptotic states in the 
conventional sense (1.7), though asymptotic states can still be introduced 
in a somewhat different sense of the word.* 

This particular example indicates that it is desirable to produce some 
criteria for the existence of strong asymptotic states. The rest of this 
section deals with the derivation of Theorem 2.8, which will be found 
very useful in establishing the existence of strong asymptotic states 
in potential scattering (see Exercise 2.8 and §7.1). 

Theorem 2*8* Suppose 3f 1 , C Sf H n , is a linear sub¬ 
manifold of which is such that e~ iH ° l fs Q) H n Sd H for all fe 2$ x . 
Then the closure of belongs to the initial domains M ± of the wave 
operators Q ± (Moller, i.e. strong, operators) if and only if when t x —► 
— oo and t 2 —> + oo the respective strong limits of 

(2.34) , t 2 )f = \ H e-^f dt, H x = H - H 0 , 

exist for every / e 3j 1 . A sufficient condition for the existence of these 
limits is that for some t x , t 2 e IR 1 

c h r +°° 

|| H x exp(—iH 0 t)f\\ dt < +oo, || H x exp(—iH 0 t)f\\ dt < +oo, 

J -00 J t x 

respectively. 

To understand the above theorem, we must comprehend the meaning 
of the integral appearing in (2.34) (see also §3.7). 

This integral is defined by the requirement that for fixed / e 

(2.35) c/> f ( g ; , * 2 ) = f <exp (iHt) H x exp (~iH 0 t)f \ g> dt 

J h 


# For details, see §4.9. 
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determines a continuous linear functional in g. In fact, if the integral 
in (2.35) exists, then, by Riesz’ theorem (Chapter III, Theorem 2.3), 
there is a unique vector f x = , £ 2 )/such that </ x | g ) = , £ 2 ). 

Thus, in order to know that , * 2 ) is well defined we only have to 
establish that the Riemann integral in (2.35) really exists. For this task 
we need Lemma 2.3. 

Lemma 23* For any two vectors / e and g e &J H the function 

(2.36) 

F(t) = <exp(-*77 0 J)/| Hexp(-iHt)g} — <H 0 exp(—iH 0 t)f \ exp(-iHt)g} 
is continuous in t e R 1 , and 

(2.37) 

«exp(— iH 0 t)f | H exp(— iHt) g) — <H 0 exp (—iH 0 t)f | exp(— iHt) g}) dt 

Proof. To establish the continuity of the first term in (2.36), we note 
that for any r e R 1 

(2.38) 

l<exp[—1# 0 (* + r)]f\Hexp[-iH(t + r)]g} - ^xp(-iH 0 t)f\ H exp(-iHt)g}\ 

< |< ex P+ t)]/ | H(exp[—iH(t + r)] — exp(— iHt))g>\ 

+ K(exp[— tff 0 (* + T )] - ex p(— iH 0 t))f\ H exp(-iHt) g}\ 

< ll/ll II H(exp[-iH(t + r)] - exp(-iHt)) g || 

+ ||(exp [~iH 0 (t + r)] - exp(-iff o 0)/|| II H exp(-iHt) g ||. 

We recall now that by Theorem 2.7 of Chapter IV, the operator 
tf(exp[— iH{t " I - t)J exp( iHt) is a function of the self-adjoint 
operator H , and consequently 

H(exp[—iH(t + t)] — exp (—iHt)) g = (exp [—iH(t + r)] — exp (—iHt)) Hg. 

Since exp(— iHt) and exp(— iH 0 t) are strongly continuous operator¬ 
valued functions in t (see Chapter IV, §6.3), the expression on the right- 
hand side of (2.38) converges to zero when t —► 0. 

The continuity in t of the second term in (2.36) is even easier to prove 
by following the same line of reasoning, since we have 

<H 0 exp[—iH 0 (t + r)]/1 exp[—iH(t + r)] g} - <H 0 exp (-iH 0 t)f \ exp(-iHt)g}\ 

< II H 0 (exp[-iH 0 (t + r)] - exp (-iH 0 t))f || || g || 

+ || H 0 cxp(-iH 0 t)f\\ ||(exp[-iff(* + r)] - exp(-iHt)) g ||. 
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The continuity of F(t) in the variable t implies the existence of the 
integral in (2.37) for any given t ly t 2 s U 1 . Thus, the first part of the 
lemma is established. 

To establish the second part, we recall from Theorem 3.1 of Chapter IV 
that whenever g e 

s-lim - (exp[— iA(t + r)] — exp(— iAt)) g = A exp (—iAt)g 

for any A = A* in general, and consequently for A = H and A = H 0 
in particular. In view of this fact, a glimpse at (2.36) suffices to show that 

F(t) = i(cHdt)(e- iH ^f | e~ im gy. 

Hence, (2.37) immediately follows. Q.E.D. 

Let us return now to the proof of Theorem 2.8. When 

f e^ 1 C @) R n , 

then exp(— iH 0 t)fe , so that 

<exp| H exp(— iHt) g) = <exp(*7ft) H exp (—iH Q t)f \ g} 9 

and F(t) can be written in the form 

F(t) = (exp(iHt) H x exp {—iH 0 t)f \ g). 

Thus, according to (2.37), we have 

(2.39) <[fi(/ a ) - \g) = i <exp (iHt) H x exp (~iH 0 t)f \ g > dt 

J h 

for any / e 3t 1 and g e . Since 2 K is dense in the integral on the 
right-hand side of the above relation can be said to determine a unique 
linear functional in g e Thus, we can write symbolically 

(2.40) [<Q(J 2 ) — = — i\ exp(iHt) H x exp 

J h 

It follows from the above relation that in the case s-lim Q(t) f exists 
for £ —► — 00 or t —► -f- 00, the respective strong limits of the integral 
in (2.34) for t x —► — 00 or t 2 —► + 00 also exist. 

Conversely, let us assume that 

J2 

s-lim exp(iHt) H x exp(—iH 0 t)f dt 

*l-*-°o J 
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exist for all fe . Then, it follows from (2.40) that s-lim also 

exists when t x —► — oo and / e . However, by Lemma 1.1, the set of all 
vectors / for which this last limit exists is closed, and consequently the 
closure of 2 1 must be contained in it. Thus, the theorem is 
established. 

For all practical purposes, the investigation of the existence of the 
strong limits in t x and t 2 of the integral in (2.34) reduces to the inves¬ 
tigation of the existence of integrals 

r +CO 

(2.41) I II exp(—ti/ 0 J)/|| dt 


for some t x e M 1 , and of 

(2.42) II H x exp(— iH 0 t)f\\ dt 

for some t 2 e R 1 . In fact, by using the Schwarz-Cauchy inequality we get 

(2.43) 

■ ^2 

<exp(t/ft) H 1 exp (—iH 0 t)f | g) dt 

- <i ) 

Hence, the existence of (2.42) implies, in conjunction with (2.39), that 
u = w-lim fi(^) / exists when t x —► -CO. Since || Q(h)f\\ < ||/|| = 
||/ + ||, we infer by using Lemma 6.2 of Chapter IV that s-lim Q{t^)f 
also exists for t 1 —> — oo. 

The argument for t 2 —► + oo is completely analogous. 

2.7. The Physical Asymptotic Condition 

The condition which the Schroedinger-picture strong asymptotic 
states x F ex (t) (ex = in, out) of the interacting state W(t) have to satisfy, 

(2.44) lim || W{t) - V* x (t)\\ = 0, 

f-»±oo 

has no direct physical meaning, since the above vector norm is not a 
measurable quantity. We recall from §§1 and 2 of Chapter IV that the 
directly measurable quantities are the probabilities 


< II g II f II exp (iHt) H x cxp(-iH 0 t)f\\ dt. 


(2.45) P$; t ]- An ( B ) = <n*) I E Al 


An (B) w(t)y. Be a*, iiy(0ll = i, 
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for any set of compatible fundamental* observables. Hence, we expect 
asymptotic states to satisfy some kind of physical asymptotic condition 
which would specify that the probabilities (2.45) for the interacting 
state W{t) approximate arbitrarily closely the corresponding probabilities 
for the respective asymptotic states } I fill ’ oxlt (t), when oo. That 

this is actually the case is stated in Theorem 2.9. 

^Theorem 2*9* Let the Hamiltonian H and a set O 0 of additional 
fundamental observables be such that any other observables of the 
interacting system or the free system is a function of compatible 
observables from = {H} U 0 0 or 0£ x = {f/ 0 } U 0 Q , respectively. 
Furthermore, assume that whenever { H , A /,..., A n '}, A/,..., A n ' e 0 Q , 
is a set of compatible observables, then H 0 , A /,..., A n ' are also com¬ 
patible. In that case the strong asymptotic states ¥ /ex (^) (ex = in, 
out) of the scattering state ¥*(£) satisfy the physical asymptotic conditions 


(2.46) 


I® I | = 0, 


(2.47) ljm | P H ^;' . K {B) - P%% .^ (B)\ = 0, 


for any sets {A x A m } C 0 O , {H, A x ,..., A n '} C 
observables. 


fief”, 

B e & n +\ 

Of of compatible 


Proof. It is straightforward (see Exercise 1.7) to prove that (2.46) 
holds, i.e., 

lim | <W(t) | E Al . a ™(B) W(t)} - <Yex(t) | E Al . Am (B) ^(^))| = 0 


when (2.44) is satisfied. 

Since Af,..., A n ' commute with H as well as with H 0 , we infer that 
for arbitrary B 2 e 


e im e-iH^ E A i . An (B 2 ) = E Al ' . A ” (B 2 ) e iHt e~ iHot . 

By applying the operators on both sides of the above equation to vectors 
f e M ± and letting t —► ^ oo, we obtain immediately E Al ’-- ’ An '(B 2 )f e M ± 
and 

(2.48) £2 ± E a ' . ^\B t )f = E A ' . A *(B 2 ) Q ± f. 


# See the discussion in the beginning of §2 in Chapter IV concerning fundamental 
observables. 
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Furthermore, ii g _L M ± , then 

<e a ' . An \B 2 )g l/> = <g I E*' . A ”(B 2 )fy = 0 

for all /e M ± , and therefore E Al ' . (B 2 )g _L M ± . Consequently, 

(2.48) holds for arbitrary /eJf\ Combining this relation with (2.4), 
we conclude that 

(2.49) Q±E h °(B 1 ) E a i a ”(B 2 ) = E^BJ E Al a "(B 2 ) Q± 

for any B x e SS X and B 2 e 

As a, consequence of the basic properties of spectral measures (see 
Chapter III, the proof of Theorem 5.5), we can write for any B e ^ n+1 

E Hq,Ai ' . A «(B) = inf E H \B[ k) ) E Al . A ”\B ik) ): BC\j B ik) x B ik) |, 

where the projector infimum is taken only over finite unions of sets 
B[ k) x B {k \ with B[ k) e^\ B {k) e^ n . Hence, E h «’ a i>-' a «(B) is the 
strong limit of some sequence of finite sums 

£ E H °(B ik) ) E Al . An \B ik) ) 

k 

corresponding to a sequence of sets [j^-(B\ k) x B (k) ), with B as an inter¬ 
section. This result in conjunction with (2.49) implies that 

(2.50) Q ± E h °- a ' . An (B) = E H - Al ' . a ” (B) Q ± 


for any B e ^ n+1 . 

We recall that 

(2.51) W(0) = fl±y«( 0), 

where ex = in for Q + and ex = out for Q_ . Hence, after noting that 


[e~ iHt , E HMl ' .^"'(B)] = E H »- Al ' 


An 'm = 0 , 


by using (2.50) we easily obtain 

(2.52) . An {B) = (e~ im W(0) | E H,Al . <(B) e~ iHt W{0)} 

= <Q ± V<*( 0 ) | E H,Al . An (B) Q ± V<*( 0 )> 

= <Q ± W**( 0 ) | Q ± E H «- Al ' . An '(B) 
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where the last step follows from the fact that i3 ± *,(3 ± y /ex (0) = ^^(O), 
since ^^(O) e M ± . Hence, (2.47) is established. Q.E.D. 

All nonrelativistic quantum mechanical theories satisfy the conditions 
of the above theorem. For example, in wave mechanics for a single 
spinless particle interacting with a potential V(r), one can take 

H 2'-(l/2m)A + V(r) 

and G 0 consisting of L 2 , L„(n e IR 3 , | n | = 1), Q k and P k {k = 1,2, 3); here, 
L 2 denotes the total angular momentum operator (encountered in §7 of 
Chapter IV), L n = n • L is the angular momentum projection in the 
direction n, and Q k , P k are the position and momentum components 
with respect to the &th axis of a Cartesian inertial reference frame. 

From the physical point of view, it is sufficient if asymptotic states 
satisfy the physical asymptotic conditions (2.46) and (2.47), rather than 
(2.44). As a matter of fact, albeit that for long-range potentials (such as 
the Coulomb potential or V(r) ~r _0£ , 0 < oc < 1) there are no free 
states satisfying (2.44), there are free states satisfying (2.46) and (2.47). 
One could call such states physical asymptotic states , and develop* 
a scattering theory based on the existence of these states. 

As a matter of fact, in this case one could define the wave operator Q+ 
to be the operator which maps an incoming physical asymptotic state 
into the corresponding interacting state, i.e., as the operator satisfying 
(2.51); could be defined in a similar manner. As soon as the wave 
operators are defined, one can proceed in the usual manner and introduce 
the scattering operator by means of (2.2) and the transition operator 
by the old formula 

(2.53) T = ( 1 - S)12ttu 

For the rest of the quantities of importance in scattering theory (transition 
probabilities, differential scattering cross section, etc.), one can also 
retain the old definitions (see §4.9). 

Exercises 

2 A* Assume that C is a bounded operator in and A x , A 2 are 
self-adjoint operators in with spectral measures E A i(B) and E A *(B), 
respectively. Prove that the relation e iA ^C = Ce iA 2 * is satisfied for all 
/ e R l if and only if E A ^(B)C = CE A *(B) for all B e 


# For details see the articles by Prugove£ki [1971a, 1978b], which also contain further 
references. 
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2*2* Show that if U is a partially isometric operator with initial 
domain M, then (JJf | Ug > = </ | g} for all /, g e M. 

2*3* Show that if E is a projector and || Ef\\ = ||/|| for some/e 
then / = Ef. 

2A* Show that/ 0 is a cyclic vector with respect to A if and only if 
E*(S*) f 0 is a cyclic vector of A c and E A (S A ) f 0 is a cyclic vector of A p . 

2*5* Prove that if a self-adjoint operator A leaves the closed linear 
subspace M invariant, then [E A (B), E M ] = 0 for all B e & 1 . 

2*6* Show that if A is self-adjoint and fe 2 A , then e iAt f e for 
all t e U 1 . 

2*7* Suppose that the potential V(r) is Lebesgue square integrable 
on R 3 so that || V\\ < + oo. Apply the result of Exercise 1.9 to prove 
that if i/j( r) is Lebesgue integrable as well as square integrable on R 3 , then 


0)(r)| < 


m 

2irt 


3/2 

110 Hi. 


ii 0 iii = f \m\d*, 

J 05 3 


almost everywhere in R 3 , and consequently 


II ^"*‘011 < 


m 


3/2 


2irt 


v \\ ||0 ||i, 


where (Ve~ iHot ifj)( r) = V(r)(e~ iHot if/)(r). 

2*8* If the potential F(r) is Lebesgue square integrable on R 3 , then 
M + - M_ = L 2 (R 3 ) in potential scattering for V(r). Explain how this 
statement follows from Theorem 2.8 and Exercise 2.7. 


3* General Time-Independent 
Two-Body Scattering Theory 

3.1. The Relation of the Time-Independent 
to the Time-Dependent Approach 

It is possible to construct the Moller wave operators Q ± by taking 
limits with respect to parameters other than the time parameter t. 
Scattering theory based on such time-independent procedures for 
computing Q ± , S , and T is called time independent or stationary . 

The relation of the time-dependent to the time-independent 
approach is best exhibited by showing that Q ± can be obtained as strong 
limits of operator functions £2 ±e for e —► +0, where the parameter c, 
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0 < e < -f- oo, is not related to the time parameter t and has no physical 
meaning, and the operators Q± e satisfy the equations 


(3.1) 


<£ I = « f e-‘Xg I dt, 

J o 

<g I Q*J> = ' f e*Kg\Q(t)f}dt 

j —00 


for all/, g e 34? and 0 < e < + oo. 

Let us show that the relations (3.1) can play the role of definitions of 
linear operators Q ±e . 

Since e iHt and e~ tHoi are strongly continuous functions of t (see §6 
of Chapter IV), Q(t) = e iHi e~ lHot is also strongly continuous (see 
Chapter III, Exercise 5.6). Hence, for any f,g£ 34?, the function 


<g I m/> = <g I e iHt t £ R1, 


is continuous, as well as bounded in t : 


K# I e im «-"•*/>! < Ik IIII e im e-^fW = ||/|| ||* ||. 

Consequently, the first improper Riemann integral in (3.1), 

(3.2) £ f e-<\g | e im e^f) dt = (g\f), 

J 0 

converges absolutely for any/, g e 34?: 

(3.3) c f |<£ | e im dt ^ ||/|| ||; || e f e* dt = ||/|| || g ||. 

J 0 J 0 

Thus, the functional (g | /) is defined on 34? x 3t?. 

It is straightforward to verify that (g I /), /, g e 3^, is a bilinear form. 
Furthermore, we see from (3.3) that (• | •) is bounded. Hence, there is 
a unique bounded linear operator (see Chapter III, Exercise 2.5), 
which we denote by Q _ € , such that 

(3.4) <£ | a_jy = (g I/) = e f” e-%g I e im e^f ) dt 

J 0 

for all/, g e 34?, We write the above relation in the symbolic form 
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under the agreement that its real meaning is given by (3.4). In fact, 

(3.5) is an example of a Bochner integral (see Theorem 3.4 in Appen¬ 
dix 3.7 to this section). 

It can be shown in a completely analogous manner that for any e > 0 
there is a unique bounded linear operator Q +€ which satisfies 

(3.6) <£ I Qjy = e f° e^g\e im e- iH ^f>dt 

j —00 

for all /, g e M . The above relation is written in the symbolic form 

Q € = e f° e et e iHt e~ iH<lt dt, e > 0. 

j —00 

We are interested in the existence of the strong limits of Q ±€ f as 
e —> +0. 

From (3.3) we immediately infer that 

(3.7) 110*11 <1. 

By applying Lemma 1.1 we deduce that the sets N + and N_ of all 
vectors for which the respective limits of Q ±e f exist, 

(3.8) H ± = {f:s-YimQ ± JeJf} 

are closed linear subspaces of Jf. We shall prove now that M + C N + 
and M_ C N_ (the converse not being generally true!). 

Theorem 3*1 ♦ If for some /e^f, s-lim^ +00 Q(t)f exists, then 
s-lim e _> +0 Q_ e f also exists, and 

(3.9) s-lim Q{t)f = s-lim Q_J. 


Similarly, if for some g e Jf?, s-lim^oo Q(t) g exists, then s-lim e _> +0 Q +e g 
also exists, and 

(3.10) s-lim Q{t)g = s-lim Q+eg. 

Proof . Let us write 


/- = s-Ji 1 ? Q (f)f 
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when the above limit exists. Then, for any g e 34? 

I<£ I —/->l 

= I « f" e~ et <g I mf> dt-*f e-‘Kg I /_> dt 

' J 0 J 0 

< € f er« Kg I Q(t)f -/_>I A < e II; II f II fl(0/ - /- II dt 

J 0 J 0 

“Hi-11 r«-Ni ■»(«/*)/-/-ii <*«• 

J 0 

We set out to prove that the above integral is smaller than any given 
rj > 0 for all sufficiently small e > 0. 

Choose some a > 0 and split the interval of integration: 

/.oo /.a /»oo 

J =J +J • 

J 0 J 0 J a 

For the first integral we obtain the following upper bound: 

f II Q ( u h)f -/- II du < f V“(|| Q(u/ e )f\\ + ||/_ ||) du < <*(||/|| + ||/_ ||). 

J 0 *'0 

Now, take some a satisfying 

0<< ‘ < S l|/|i ' +||/_|| • 

and then choose e > 0 so small that 

(3.11) \\mf-f-\\<vi2 

for all t ^ a/e. For any such e, we compute that 

f e-\\Q(ule)f-f_\\du 
J o 

< \ y F| j"j ||y_ || (H/ll + ||/_ II) + J e~ u fa du^. fa + fa J e~ u du = -q. 

Consequently, for any given rj > 0 we have 

for all e > 0 which are such that (3.11) is true whenever t ^ a/e. 
Hence, we have established that 
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tends to zero, and for any choice of \' (k) e [A (fc “ 1) , A (fc) ], k = 1,..., w, 
the limit of Riemann-Stieltjes sums 

(3.17) lim £ F (- or(A ( *" 1) )) 

6 ~*° /c=l 

exists, and is independent of the selected sequence of subdivisions; if 
the integral (3.17) exists, then its value is taken to be 

(3.18) f F( A) d A o-(A) = lim £ F(A'<*>)[a(A<*>) - o(A < * -1> )]. 

If F(X) and <r(A) are defined on R 1 , then the improper Riemann-Stieltjes 
integral on R 1 is defined as 

(3.19) f J(\) d A a(X) = Hm J>(A) d A a{ A), 

a -*—oo 

provided that the two limits in a and b exist. For functions F(A, A 0 ) and 
<r(A, A 0 ) of two variables A, A 0 e R 1 , we define the cross-iterated Riemann- 
Stieltjes integral on [a, b ] X /, where I is a nondegenerate interval in R 1 , 
by requiring that for any of the above sequences of subdivisions of \a> b], 

(3.20) C d x f F(A,Ao)^o(A,Ao) 

J a J I 

= ljm £ J F( A'<*>, A,) rf,.[a(A<», A 0 ) - a(A»-«, A,)], 

where the above limit should be finite and of a value which is independent 
of the chosen sequence of subdivisions. 

It is easy to relate Riemann-Stieltjes integrals to integrals with respect 
to measures when F(A) and <r(A) are Borel measurable functions (see 
Exercise 3.3). It is equally easy to prove in a direct manner that most of 
the usual properties of Riemann integrals (which are obviously special 
cases of Riemann-Stieltjes integrals in which <r(A) = A) are also properties 
of Riemann-Stieltjes integrals (see Exercises 3.4-3.8). 

We find the concept of Riemann-Stieltjes integration of importance 
because it enables us to give a meaning to the integral in (3.16), namely, 
the integral in (3*16) can be written as a cross-iterated Riemann-Stieltjes 
integral with respect to the function (see also Definition 3.4) 

"(AA)=r<iM E* dt. 

J 0 


(3.21) 
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Let us introduce the following convenient notation: 

(3.22) or((A, A'] x (Ao , Ao']) = o(A', A 0 ') - <x(A\ Ao) - o(A, A,/) + o(A, Ao). 

We can establish that (3.16) is true by means of Lemma 3.1. 

*Lemma 3*1 ♦ Suppose the real function ct(A, A 0 ) is of bounded 
variation on \a> b ] X [c, d \, i.e., that for any A e [a, b ] and A 0 e [c , d] we 
have 


( 3 . 23 ) v(X, A,,) = sup X I "((A* 1 - 1 *, A (<) ] X (A*^, A<*>]) 


- 00 , 


where the supremum is taken over all subdivisions 

(3.24) a = A <0> < A (1) < - < A (m) = A, c = A* 0) < A^> < - < A^ = A 0 

of [a. A] X [c, A 0 ]. If the complex function F{ A, A 0 ) is continuous on 
[a,b] X [Cy d\ y then both its cross-iterated Riemann-Stieltjes integrals 
with respect to ct(A, A 0 ) exist and 

(3.25) f d x fV( A, Ao) d Xo o(\, A 0 ) = f d Xo f F( A, Ao) d x a{ A, Ao). 

J a J c J c J a 

Proof. It is easy to see that the functions 

^(A, ^o) = M^A? + a ( a ? c ) =t a (K Ao)] 

are nondecreasing in [a 9 b] X [c y d] in the sense that ct ± ((A, A']x( A 0 , A 0 '])^ 
whenever A ^ A' and A 0 ^ A 0 '. Since 

<KA, Ao) = <r + (A, A 0 ) — <j_(A, A 0 ), 

we can exploit the linearity properties (see Exercises 6.4 and 6.7) of 
cross-iterated integrals to reduce the integrals in (3.25) to a sum of 
integrals of real functions with respect to ct + (A, A 0 ) and ct_(A, A 0 ). Hence, 
it is sufficient to prove the lemma for the case of a nondecreasing ct(A, A 0 ) 
of bounded variation and for real F(X f A 0 ). 

By using the mean-value theorem for Riemann-Stieltjes integrals 
(see Exercise 3.8), we obtain 


f 0 F( A' ( « A 0 ) d Xo [o(A<« A 0 ) - a(A«-« Ao)] 

J x (k~D 
0 

= F( A' (<) , \' a (k) ) a((A (< - l) , A (<) ] X (A'* ; - l) , A< ft) ]) 
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Since ||/_|| = lim^ +00 1| Q{t)f\\ = ||/|| ^ || £L e /||, we have shown that 
the weak limit (3.12) is also a strong limit (see Chapter IV, Lemma 6.2), 

/- = s-lim £?/, 


i.e., (3.9) is true. 

The proof of (3.10) proceeds along identical lines. Q.E.D. 

As an immediate consequence of the above theorem we obtain the 
important result 

(3.13) Q± = s-lim Q ±e £ M± , 

which can be considered as a time-independent alternative definition 
of the wave operators. It should be noted, however, that Theorem 3.1 
establishes only that M ± C N ± , and not that M ± = N ± . Since the con- 
converse of Theorem 3.1 [which would state that the existence of 
s-lim e _* +(y Q ±e / implies the existence of s-lim t ^ ±O0 Q(t)f] is not generally 
true (see Howland [1967]), it has been suggested (see Jordan [1962a, 
Section 6]) that Q ± could be defined in a time-independent manner, 
by taking limits of £? ±e /, even in cases when the time-dependent de¬ 
finition (2.1) is not adequate because M ± = {0}. 

In potential scattering, for classes of potentials which have been con¬ 
sidered in the last section, the above problem does not arise, since in 
those cases M ± = and consequently 


Q± = s-lim Q(t) — s-lim Q ± e . 

f-*Too v ' e-*+0 e 

From (3.13) we get for any 

<V±*f\g> = 

Since Q ± * are partially isometric with initial domains K ± , we easily 
verify that 

= ]im<E„QZE R J\gy 

by noting that for/eR ± the above relation is equivalent to the relation 
preceding it, while for / J_ R ± , the expressions on both sides of the 
above relation vanish. In view of the fact that 


II E M± QtE R J || < || E r J || = || Q ± *E r J II, 
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we infer from this result (by using Lemma 6.2 of Chapter IV) that 
(3.14) Q±* = s-lim E^ ± Q%E R± . 


3.2. Lippmann-Schwinger Equations in Hilbert Space 


One of the advantages of the time-independent limits (3.13) for deter¬ 
mining Q ± is that they lead to integral equations which can be used to 
compute Q ± without having to compute Q(t). 

Using the spectral decomposition H = J A dE x and recalling that 
e iHt is a function of H , by employing Theorem 2.1 of Chapter IV we 
obtain 

(3.15) <* | £>_/> = 6 f e~Ag | e im } dt 

J 0 


= « f dt*-* f e M d,(g \ E? e- iJI °T } 
J o •'r 


Now, we would like to interchange in the above relation the order of 
the integrations in A and t , and write 


-+00 -oo 

(3.16) (g I £?_/> = e f dA exp[— (e - a)*]<* I E“exp(-iH 0 t)f> dt. 

J —co J 0 

However, before we ask whether such an interchange of the order of 
integration is correct, we must make clear the meaning of the integration 
in A occurring in (3.16). We will show that the interchange of the order 
of integration can be easily justified if the integral in A is taken to be 
a Riemann-Stieltjes integral* 

Definition 3*1 ♦ The Riemann-Stieltjes integral 
[ b F{X)d x a{X\ a < b, 

J a 

of the complex function F(A) with respect to the complex function 
<r(A), a ^ A ^ b y is said to exist if for any sequence of subdivisions 
a = A (0) < A (1) < ••• < A (n) = ft of [<z, ft] which is such that the 
norm 8 = max fca3llj< <>n (A (fc) — A (fc_1) ) of the subdivisions in the sequence 


# Measure theory (i.e., Fubini’s theorem) cannot be applied since in the resulting 
formula (3.30) the Riemann-Stieltjes integral cannot be interpreted as an integral with 
respect to a measure on (R a , & 2 ). Indeed, if such a measure existed, then, by the Riemann- 
Lebcsgue lemma we would obtain that <g | Q ±/> s= 0 if S ^ and S^o are absolutely 
continuous. 
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for some Ao fc_1) ^ Ao (fe) ^ X ( 0 k) . Combining this result with the defining 
relation (3.20), we get 

( d x fF(X,X 0 )d Xo a(X,X 0 ) 

J a J c 

m n 

= lim £ £ F(X' H \ Af>) <r((A <<_l) , A (i) ] X (A^ 1 *, A< fc) ]) 

i= 1 A:=l 

in the limit of finer and finer subdivisions (3.24) of [a , b\ X [c, d\. The 
existence of this limit and its independence of the chosen sequence of 
subdivisions {(A (i) , A ( 0 fc) )} is obviously a necessary and sufficient condition 
for the existence of the integral. To establish the existence of this limit 
and its independence on the sequence, it is sufficient to note that if 
{(A^, A^ ,l) )} is a subdivision of [ a , b] X [c, d\ , which contains and 
is finer than the subdivision {(A (i) , A£ fc) )}, and if X' Q ik ’ l) ) are points 

within the meshes determined by {(X {i * j \ X ik ’ l) )}, then due to the uniform 
continuity of F( A, A 0 ) we can achieve for any given € > 0 

|F(A' (U) , X' 0 ik > l) ) ~F(X ' U) ; X' 0 ik) )\ < € 

for all sufficiently fine subdivisions {(A (i) , A ( ( f ) )}. Consequently, if X {i > j) 
and X { 0 k,l) denote the points preceding X H > j) and X ik,l \ respectively, in 
the subdivision, then 

I £ £ K ik,l) ) A (u) ] x (AJ*- 0 , x ik - l) ]) 

' i,j k,l 

-XE^’(A' <l) ,Af ) )a((A (i - 1) A (<) ] x (A^.A®]) 

i k 

<e££"((A (i ' 3) ,A (w) ] X 

ij k,l 

= e<j((a, b] X (c, </]). 

This establishes the existence of the considered limit, as well as its 
independence of the sequence of subdivisions. 

By the same token 

f d Xo fF(X,X 0 )d x *(X,X 0 ) 

J c J a 

m n 

= lim £ £ F{X" H) , A" w ) <r((A <i_l) , X H) ] x (Aj*" 1 ’, A<*>]) 

i= 1 fc— 1 
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also exists. By exploiting again the uniform continuity of F( A, A 0 ) in 
[a,b] X [c , d], we conclude that for any e > 0 

| AZ (k) ) — F(\ /(i) , X' 0 ik) )\ < e 

for all sufficiently fine subdivisions (3.24) of [a, b ] X [c, d\. Hence, 
the absolute value of the difference of the two integrals in (3.25) is smaller 
than 


m n 


EE 


| F(X" (i) , Xg k> - F(X' (<> , X^ </C> )l I CT((A <i - 1> , A <!> ] 


X (Ao ft-1> , Aj*>])| 


i=1 k =1 

= €a((a, b ] X (c, d ]). 

Since this is true for arbitrarily small values of € > 0, we conclude 
that (3.25) holds. Q.E.D. 

Let us compute the total variation function ^(A, A 0 ), defined by (3.23), 
for the function o(A, A 0 ) given in (3.21). By using in the process a decom¬ 
position of the type (5.13) in Chapter III, we easily derive 

(3.26) 

m n 

v(\, A„) < sup £ X f ° |<£ ff ((A ( * -l) , A (i) ] )g I e-^f'A dt 

M k =1 J 

o 

./\ 0 m 

= sup f £ Ke-^f | E h ((X u ^, A w ])^>| dt 

J C i =1 

< i ({<•-"**/ + g I E H ((a, XW***/ + *)> 

J C 

+ <€**** f + ig | E H ((a, A ])(e~ iH ^f + ig)} 

+ VXe-^f | E H ((a 9 A]) e~ iH ^/} + VKg \ E H ((a , X])g}} dt. 

Since the integrand in the above integral is continuous and therefore 
integrable, it follows that ct(A, A 0 ) is of bounded variation. Consequently, 
Lemma 3.1 can be applied to infer that 

fdt f* d^g | E«e- iH ^f) = f d K fe-^ <g \ E? dt. 

J 0 J a J a J 0 
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To establish the validity of (3.16), we have to prove that the above 
relation remains true in the limit when a — oo and 6, r — > + oo. 

In view of the relations (see Exercise 3.5) 


(3.27) 


+ oo r +oo r r r b r +°° r +oo rT r a r r r +oo 

J J -J J =J J + f f +J f > 

J 0 J —oo J 0 J a J t " —oo J o J —oo ^ 0 J b 

r +co r +co r r r +°° /*+°° /»a r r -+co - T 

J f -f J =J f + f J +/ / - 

J _oo J o J a J 0 J —oo J r J —oo o J b J 0 


and (3.26), we see that this will be the case if and only if the integrals on 
the right-hand side of the equalities (3.27) exist and vanish when 
a —► — oo and 6, r — ► + oo. 

Employing the procedure used in deriving (3.26) we arrive at the 
following estimate: 


C dUT* f”° e iM d x (g | £?«-"•*/> 

J c Q J a 0 


< J“V‘ /*>*<*& 

J C 0 J «o 


?0< /> 


dt 


< * f* + £ | E^(K , b 0 ])(e- iH ^f + *)> 

J *o 


+ <€*** + ig | E H ((a 0 , b 0 ])(e- iH ^f + ig)} 

+ V2(e-^f | E H ((a 0 , ftj) e~ iH ^f} + V2Q | E H ((a 0 , b 0 ])g}} dt. 

By Lemma 3.1 in Chapter IV, the integral on the right-hand side of the 
above inequality can be made arbitrarily small, e.g., for sufficiently 
large values of either c 0 (for arbitrary values of d 0 > c 0 ), or of a 0 (for 
arbitrary values of b 0 > a 0 ). Since, according to Lemma 3.1, the order 
of integration in A and t can be reversed without changing the value 
of this integral, we conclude that all integrals on the right-hand sides 
of the relations (3.27) vanish in the limit when a —► — oo or 6, r —^ + oo; 
for example, for the representative case of the first of these integrals, 
we can choose for given rj > 0 a t^) and a sequence r(rj) = r 0 <Ti < 
t 2 < ••• diverging to infinity which is such that 


and consequently 



< V k y 





Z v k = 


17->0 


0 . 
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Thus* we conclude that (3.26) stays valid in the limit a —> —oo and 
6, r -> +oo, and therefore (3.16) is true. 

Using the fact that e~ iHQt is a function of H 0 , and applying Lemma 3.1 
in the same manner as before to interchange orders of integration, we 
arrive at the result 


f exp[— (c — *A'X|<£ | Ef exp[— iH 0 t]fy dt 
J 0 

*00 *+00 

= dt exp[—(e - *A')<] exp[-«A 0 <] d^Efg \ E^f) 

0 J —00 
*+°0 * 0 ° 

= f d Xo exp[—(e - iX + i\ 0 )t]<E?g I <»/> dt 

J —co J 0 

*+00 1 

-L «-«•+, i 

Inserting this result in (3.16), w.e obtain 


a+OO /• +oo - 

(3.28) <* | Q_,/> = J_^, I 

In a completely analogous manner we can derive a corresponding 
relation for Q +€ : 


*+oo *+00 _** 

(3.29) <£ | Q + jy = + <4 + A _ + ^ d^g | «»/>. 

The two relations (3.28) and (3.29) are equivalent to the single relation 


(3.30) <£ I Qjy = f °° d A f” • + d^Efg I <»/>, 37 + 0, 

J _oo J _oo A *7 

if the parameter e > 0 is replaced by the new parameter rj + 0, which 
can assume also negative values. 

Using the defining formula (3.20) for cross-iterated integrals, we 
obtain from (3.30) 

<&r,*g \f> = <g\ &J> 


w *+oo _ -V, H H H 

= lim lim Y f —-+-- d. <(£> - £>-d) g \ E x •/> 

6->+oo S->0 J A (fc) — A„ — in V" A A 35 I A 0 V / 

a->—oo 1 u # 

n y I • V 

= lim lim y /g £W(A<*-«, A< fc >]) - -r-/ > 

= <*'/!>'a-t 

» 

where the last integral is defined by the limit preceding it.* 


# See also Definition 3.4 for integrals of this type in Appendix 3.7 to this section. 
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According to Theorem 2.2, of Chapter IV, 

( _I=2_ \* = __ 71 = 7 ,** 0 

\A -H 0 - ir)) A - H 0 + irj 9 V V ^ 9 

and consequently we have 

(3.31) <13,., |/> - ( *Jr I/). 


Since ^(A — H 0 + ^) _1 and (A — i/ 0 )(A — + n?) -1 are bounded 

functions of H 0 , they are defined everywhere in Jff. According to 
Theorem 2.4 of Chapter IV, we can add and multiply these operator 
functions as we would ordinary functions, and therefore we have the 
identity 


(3.32) 


ir 1 

A — H 0 + ir) 


X — H 0 
A — H 0 + ir) 


Substituting the right-hand side of the above equation in (3.31), and 
using symbolic notation (see also Appendix 3.7 to this section), we 
arrive at the following expression for 


(3.33) 


a 


r + A 

} * = 1 — —- 

J-ooA — 


H n 


H 0 + ir) 


d\E\ • 


Consider a free state e~ iHot W, W e M ± , which is the incoming asymp¬ 
totic state of e~~ iHt W + or the outgoing asymptotic state of e~ iHtl P ~. Since 
obviously 

W ± = s-lim e iHt e~ iHot W = Q ± W, 

00 


we deduce from (3.14) and (2.10) 

s-lim E m .Q v *W ± = Q±*Q ± V = W. 

7}-»±0 == 

By Theorem 3.5 in Appendix 3.7 this result, combined for M ± = Jf 7 
with (3.33) leads, after a rearrangement of terms, to the equations 

r+°° 1 

( 3 . 34 ) i* - r + s;lim », d,E^, = 

The above two relations can be considered to be equations for deter¬ 
mining W~ and ¥ /+ , respectively. We shall refer to them as Lippmann- 
Schwinger equations in Hilbert space . Indeed, in the next sections we shall 
see that in the special case of potential scattering these equations 
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lead to a commonly encountered form of integral equations for distorted 
plane waves, usually called Lippmann-Schwinger equations. 


3.3. Spectral Integral Representations of Wave Operators 


The main drawback of (3.34), viewed as equations for finding W ± 
when W is given, is that they require a knowledge of the spectral function 
Ef of the total Hamiltonian H. Now, in practice, Ef is an unknown 
quantity which is very difficult to compute. In fact, if Ef were known, 
then one could easily compute the time evolution operator 
U{t y t 0 ) = exp[— iH(t — J 0 )], and thus solve the dynamics of the 
problem generally, without even having to resort to scattering theory 
techniques. 

A practically more suitable set of equations relating W and which 
yield themselves to perturbation techniques, can be derived from (3.1) 
by applying the spectral theorem first to H 0 and then to H , and otherwise 
proceeding in the same manner as in the derivation of (3.16): 


-+00 r +CO 

( 3 . 35 ) (g I Q^fy = € f dte~ H f «-"•* d^e~ im g | <«/> 

0 J —oo 

/•+ 00 -+co 

= «f <4„f e- ( ^\g\e im E^/ydt 

J —oo J o 

*+00 -+00 -+00 

= « f d Ao f dte- (e+u ° u f e m d,Q I E»E%fy 

J —oo 0 ^ —oo 


r+c° /*+°° 


1 


€ -j- iXq — iX 

Hence, we end up with the relation 


(3.36) 


<f I I 

- (Ar.-H+kTu^y 


An analogous relation can be derived for Q +e . After reintroducing the 
parameter rj ^ 0, we can write (3.36) and the corresponding relation 
for iQ +e jointly in the symbolic form 


< 3 - 37 > a -- Ch^- k~+ - e “'- 

Naturally, if we exploit the relation 

_—n?__ j _ H — A 0 


(3.38) 


H — A 0 — tr) 


H — \ — it}' 
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(3.37) can be recast as 

(3.39) Q -~ 1 
Hence, if we again write 

(3.40) ¥* = Q ± W = s-lim QJF, <FeM±, 

rj->±0 

then in view of (3.39) and of Theorem 3.5, 

/. + 00 1 

(3.41) »» = g + A( _ „ + ., H, d^Y, 

The above two equations are the solution-type Lippmann-Schwinger 
equations in Hilbert space , since they can be viewed as solutions of (3.34). 
However, they actually are not explicit solutions of the equations (3.34) 
since they contain the operator function (A 0 — H + irj ) _1 ( H — H 0 ), which 
is unknown in practice. On the other hand, we shall see later in this 
section [see (3.67)] that (3.41) can be solved by iterative procedures. 
Let us write (3.39) in the form 

(3.42) </l *,*> = </!*> + ' 

Taking adjoints on both sides, we obtain 

a i b„*/>-<? i/>+<«| /) • 

In symbolic notation the above relation assumes the form 

/.+00 TJ _ \ 

(3.43) 

which will be very useful further on. 

3.4. The Transition Amplitude 

It was mentioned in §1 of Chapter IV that if W^t) and *FJt) are 
states in the Schroedinger picture, the | (Y^t) | ^-(0) I 2 * s called the 
transition probability from x P_(t) to ¥+(*); furthermore, the complex 
number (W^t) | x f / _(t)') is called the transition ( probability) amplitude 
from WJt) to Y + (t). 
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Let us assume that !?+(*) and W_(t ) are scattering states, with asymp¬ 
totic states ^^(t) and ¥ / l x (^), respectively. Since 


!P±(0) = fl + y±(0) = Q_WT\ 0), 

we easily obtain 

<n(01 ^-(O) = <^ + (o) i y_(o)> = <n ut (o) i sv'j'i o)>. 

Let us simplify the notation by writing 

^i n (0) = ¥i , Wl u \0) = V t , 

where the indices i and f stand for “initial” and “final,” respectively. 
Using the properties of strong limits, we obtain 

(3.44) CP t I ^i> = <y f | | fl+lPi) 

= <s-limi3_ E ^ f |i3 + ^i> 

e-»+0 

- lim<fl_JP f | Q + ^i). 


Substituting the expression (3.39) for *Q_ e in <£?_ e *¥t | we get 

(3.45) 

<o~^t I fl+y,) = <y f I fl + y,> + I fl+^i). 

On the other hand, after using (3.43) to express the identity operator in 
terms of the other two operators occuring in that equation and then 
applying it to the vector Q + W i , we arrive at the relation 

(3.46) Q + V i = a*fi + V , - J d x Ef : O+^i • 

After substituting the above expression for fi + ¥ / i in the first term on the 
right-hand side of (3.45) and carrying out an obvious summation, we 
obtain 

(3.47) <Q_JP t \Q + 'Pi> 

= OF, I - (v, | f l d * o^.> 

In view of (3.44) and the fact that if M + = M_ , 

(3.48) | 0^3+'*' i> = <!P f | £ M+ ^> = <!P» I !Pi>, 

■|‘U ' 
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we deduce from (3.47) that 
(3.49) 

<Pt I SWi> = <Wi | Vi> - lim 

e-»+0 





2ie(H — A 0 ) 
(H- \ 0 )* + e* 



In terms of the transition operator 

( 3 . 5 °) 

and provided that M + = M_, we can write (3.49) in the form 


(3.51) <P { | TVi} 

=; Jffl <' p ' IC irr^wh 

" ; is (*• I C d ^ H ~ w 


(H 0 - A 0 )* + 


'Pi 


) 


by taking advantage of the intertwining property (2.4). Using Lemma 
3.3, we immediately obtain from (3.51) 


1 / | /. + 00 

(3.52) <P, | T¥i> = i lim (P { | J ^ 


(H 0 - A 0 ) 2 + ^ 


P\ 


)• 


It is interesting to note that on account of the relations 

(3.53) Pi = ¥^"(0) = s-lim e iH<>t e~ iHt P_(0), 


(3.54) P t = ¥^ ut (0) = s-lim e iH<>t e~ iHt P + (0), 

t-*+co 

the vectors W t and ^ can be considered to be the asymptotic states of 
the interaction-picture states [see (3.43) of Chapter IV] 

(3.55) W ± (t) = e iHot e~ iHtx F ± (0), 

respectively. Thus, it can be said that (3.52) provides an expression for 
the transition amplitude (W t | TW^) from the incoming interaction- 
picture asymptotic state W i to the outgoing interaction-picture asymp¬ 
totic state W t . 

3.5. The Resolvent of an Operator 

In the Lippmann-Schwinger equations (3.34) and (3.41) we 
encountered operators of the form (A — A — where in the first 

case A stands for H 0 and in the second case A stands for H . Operators 
of the form (A — £) -1 , where £ is in general a complex number and A is 
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a linear operator in Jf*, play a very important role in functional analysis. 
Since they have been systematically studied, it is advisable to acquaint 
oneself with the results of such studies, and then apply these results 
to the special problems at hand. 

Definition 3*2* If A is any linear operator in the resolvent 
RJ£) of A is the operator-valued function 

(3.56) R a (0 = (A~C)~\ £eC\ 

defined at all complex values of £ at which (A — £) -1 exists. 

We note that, according to Theorem 4.2 of Chapter III, (A — £) -1 
exists if and only if the equation (A — £)/ = 0 has the unique solution 
/ = 0, i.e., if and only if £ is not an eigenvalue of A. 

We can classify the points £ in the complex plane C 1 in relation to a 
densely defined linear operator A as follows. 

Definition 33* The set of complex numbers £ consisting of all 
points £ e C 1 at which the resolvent R A (Q is a bounded operator defined 
densely in is called the resolvent set of A . The complement of the 
resolvent set is called the spectrum S A of A. The spectrum S A is the dis¬ 
joint union of the point spectrum Sp, continuous spectrum , and residual 
spectrum , which consist of all points which have the following 
respective properties: Sp of the points £ e S A at which R A (Q does not 
exist; of all £ e S A for which i?^(£) exists, is unbounded and defined 
on a dense subset of S A of all £ e S A at which R A (£) exists but it is 
not defined in a dense subset of 

In the case where A is a self-adjoint operator, the above definition of 
the spectrum coincides with Definition 6.2 of Chapter III. To establish 
this equivalence, we need to show that 

(3.57) a. M - H„ lTT Z T li*l 1 * W < +4 


If / belongs to the domain of (A — £) _1 , there is a vector g e 2& A such 
that f — (A — Qg. Consequently, we have 

L TT^TP '»'W = L TT^TP d ‘ ( 11 1' v - 5 '**•'' ^ “’) 


Conversely, if 


= / 1 A - g|2rf|1 E l g l|2 = [[g l|2 < +0 ° 
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Theorem 3*2* Let H and H 0 be two closed (not necessarily self- 
adjoint) operators in the Hilbert space having dense domains of 
definition Sj h = . The following second resolvent equations are 

satisfied by the resolvents R(Q and i? 0 (£) °f H and > respectively, 

(3.61) R(t) - R 0 (Q = -R 0 (Q(H - H 0 )R(Q = ~R(Q(H - H 0 ) R 0 (Q 

for any £ which belongs to the resolvent sets of both H and H 0 , i.e., 
for £ ^ U S^ 0 . 

Proof. Since H and H 0 are closed, i?(£) and R 0 (Q are defined on the 
entire Hilbert space whenever £ does not belong to the spectra of H 
and H 0 . Hence, for every there are vectors g e Sf H and h e 

such that / = {H — £) g = (H 0 — £)A. Since = @ Hq , g also belongs 
to , and therefore 

R 0 (£)(H - H 0 )g = R 0 (Q(H - H 0 ) *(£)/ 

is defined. Setting above (H — H 0 )g = [i/ — £ — ( H 0 — £)]£, we get 
the relation 

R 0 (Q(H - H 0 )g = R 0 (Q(H - Qg - g = R 0 (Qf - *(£)/, 

which is equivalent to the first of the identities (3.61). 

The second identity in (3.61) can be established by a similar 
procedure. Q.E.D. 

Before proceeding with the applications of the second resolvent 
equations, we shall derive, for the sake of completeness, the first resolvent 
equations. 

Theorem 33* If £ x and £ 2 belong to the resolvent set of the closed 
and densely defined linear operator A , then the first resolvent equations 

(3.62) R a (U) ~ = (£i - £ 2 ) ^(£ 1 ) ^(£ 2 ) 

are satisfied. 

Proof. The range of A — £ is identical to the domain of definition 
of R A (f) = (A — £) _1 , which coincides with vdien £ is in the 
resolvent set of A. Hence, for any fe Jf, a vector £ can be found so that 
f = (A — £i)(^4 — £ 2 )^, and therefore 

(3.63) (£ x - £ 2 ) R a (^) R A (C t )f = (£ x - U)g. 

On the other hand, we have 

(3.64) [R A (£J - R A (U)]f =(A- U)g -(A- = (£i - £ 2 )£. 

Since (3.63) and (3.64) hold for any /e^, (3.62) follows. Q.E.t). 
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Let us return now to our main task of giving a perturbational method 
for computing transition probabilities. 

Using Theorem 3.2 we infer that, if H 0 = H 0 *, H = H *, and 
2#h — ®h 0 > 

(3.65) R(i) = R 0 (t) - R 0 (Q 

whenever Im £ 0. After n successive iterations of the above relation, 

we obtain 

(3.66) R(Q = RS) t (-1)* (#1*0©)* + Rn , 

0 

where the remainder R n is 

(3.67) R n = (-1)-^ (RS) #i) n+1 *©• 

If the above remainder converges to zero, in some sense,* when n —> + oo, 
then (3.66) provides us with a means of computing /?(£) when /? 0 (0 
is known: 

(3.68) R(Q = RS) ~ *o(0 #i*o(£) + RS) H^S) H,RS) ± - . 

This result combined with (3.67) leads eventually to a “perturbation” 
solution of the Lippmann-Schwinger equations. This solution is obtained 
by substituting for R( A 0 + irj) = (H — A a — irj )- 1 in (3.41) the series 
on the right-hand side of (3.68). If the series so obtained for the integral 
in (3.41) converges, and the order of summation and integration can be 
interchanged, then W ± can be computed to any order. 

3.7. Appendix: Integration of Vector- 
and Operator-Valued Functions 

In the last two sections we encountered integrals, such as (3.5), which 
are special cases of Bochner integrals. The theory of Bochner integration 
is based on the following theorem. 

Theorem 3*4* Suppose ^ is a measure in the measurable space SC 
and that /(f), f e /?, is a vector-valued function on the measurable set R , 
assuming values in a Hilbert space #C . If (g |/(f)> is measurable on R 
for any g e and if 

(3.69) f ||/(f)||dM0<+oo, 

J R 


* See also Lemma 5.2 and Theorem 5.9 of this chapter. 
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then, according to Theorem 2.5 of Chapter IV 

(3.58) 

is defined. Moreover, g = [1/(^4 — 0]/ belongs to because 

L H ^ xM' = L TX^T '» Ein ' < +“■ 

Hence, by (2.25) of Chapter IV, 

(- 4 -Oi-V-O -T^xf - <» - 0 rh **if = f- 

i.e.,/is in the domain of (A — £) -1 . Thus, (3.57) is established. 

By comparing (3.58) here and (2.19) of Chapter IV, we see that the 
above argument also establishes that when A is self-adjoint, 

(3.59) = 

Hence, for self-adjoint operators, Definition 6.3 of Chapter III and 
Definition 3.3 of the point spectrum Sp of A coincide, since ^({f}) 0, 

£ e R 1 , if and only if £ is an eigenvalue of A, in which case (A —* £) -1 
does not exist. 

We shall prove now the equivalence of the two definitions (6.3 of 
Chapter III and 3.3) for the continuous spectrum of a self-adjoint 
operator. 

When, for a fixed real £, #*([£ - e, £ + e]) ^ 0 for all e > 0 
and -^({f}) = 0, we must have either E A ([£ — e, £)) =£ 0 or 
£^((£, £ + e]) ^ 0 for all e > 0. Let us say that the first alternative is 
the case. Then the equations 

E a ([C-€ 9 £))/ c =/ c , \\f € \\ = 1, 

can be satisfied for all e > 0. Inasmuch as 


l^(Q/J 2 = J 


fc—.o (A - 0 * 


II* = 


we conclude that i?^(£) is an unbounded operator. On the other hand, 
m) is defined on the dense set of all vectors g satisfying 
E A ([t — e, £ + e])^ = 0 for some e > 0. Thus, if £ is an element of the 
continuous spectrum in the sense of Definition 6.3 in Chapter III, then 
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it belongs to the continuous spectrum in the sense of Definition 3.3. 
The complete equivalence of these two definitions of S A for a self-adjoint 
operator A follows from Lemma 3.2. 

Lemma 3*2* The resolvent of a self-adjoint operator A is a 

bounded linear operator if and only if either Im £ ^ 0 or £ is real but 
E A ([£ — e, £ + e]) = 0 for some e > 0; in either case & Ra (c) = . 

Proof. We have seen earlier that if £ is real and E A {\t i — e, £ + e)] ^ 0 
for all e > 0 , then /?*(£) either does not exist or, if it exists, it is an 
unbounded operator. 

Assume now that £ is real and E A {{1 — e 0 , £ + e 0 )) = 0 for some e 0 > 0. 
Since the function l/(£ — A) is bounded on 

(-QO, £ — e 0 ] U [£ + e 0 , +oo), 

we conclude that 

edr?'»#»* - C * 11+ C.fd-E ? 1Ein ‘ 

< 1 f £_eo d iiitf/n 2 + A f dl II^ n* = h h/h 2 - 

€ 0 J -oo € 0 J C+e 0 € 0 


Hence, it follows from (3.57) that R A (£) is bounded and defined for 
all/ejsf. 

In the case where Im £ 0 , we have 


f _L 

J nji | A 




Im £ 


\ f d\\Eff \\ 2 

(TO 1 


ll/ll 2 

I Im £ | 2 ‘ 


The above inequality establishes that when Im £ 7 ^ 0, 
(3-60) 


and R al (£) is defined for all/e . Q.E.D. 

It follows from the above lemma that a self-adjoint operator has no 
residual spectrum. 


3.6. The Resolvent Method in Scattering Theory 

The two resolvents we encounter in time-independent scattering 
theory are the resolvent i? 0 (£) °f the free Hamiltonian i / 0 and the 
resolvent i?(£) of the total Hamiltonian H. However, while i? 0 (£) * s 
easy to compute in practice, /?(£) is an unknown quantity. Thus, 
Theorem 3 . 2 , which relates /?(£) and i? 0 (£), proves to be very useful. 
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then: 

(a) there is a unique vector h e called the Bochner integral of 
/(f) on R and denoted by 

f /(f) <Wf), 

J R 

which is such that for all g e 

(3.70) <g\h>= f <g\m>M0\ 

J R 

(b) if A is a bounded operator on Jf? and || Af(g )|| is measurable, 
then 

( 3 . 71 ) a f mMi) = f Am 

Jr Jr 

Proof . (a) The linear functional 

<Kg) = f <f(i)\g>Mi) 

J R 

is bounded on account of (3.69): 

\m<\ l</(f) l£>l Mi) <11*11 f II m\Mi)- 

Jr Jr 

Hence, by Riesz’ theorem there is a unique vector h which is such that 

<Kg) = < h \ g > for a11 s G 

(b) The Bochner integral of Af{£) exists since <g | Af(ij)y = 
< A*g | /(f)) is measurable for any g e and 

f II 4Af)ll Mi) < \\A || f ||/(f)ll Mi) < +«>• 

Jr Jr 

In order to prove (3.71) it is necessary and sufficient to show that 

(3.72) <g\Ah}= f <g\Am>Mi) 

J R 

for all g e Jtf*. Using the definition of the Bochner integral, we get 

<* I Ah > = <A*g | *> = f <A*g \m> MO, 

j R 

thus establishing that (3.72) is true. Q.E.D. 
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The Bochner integral for vector-valued functions can be used to 
define Bochner integrals for operator-valued functions A(£). As a matter 
of fact, if the Bochner integral 

a jin = f vmn 

J R 

exists for all f e then the mapping /-> A R [f\ is linear since 

(g I ^*( a i/i + ^2/2)) = f (g \ ^(£)( a i/i + ^2/2)) <Wf) 

J R 

= a, f (g | A(£)fo +a 2 \ (g \ A(£)fo d&) 

Jr Jr 

= “i<g I A R f L > + a 2 (g | A R f 2 y = <g | a x A R f x + a 2 A R f 2 > 
for all g e . Hence, 

(3.73) A r =\ A(S)driS) 

J R 

can be defined to be the Bochner integral of A(£) on R. 

It is useful to note that the integral in (3.73) has the property (see 
Exercise 3.9) 

(3.74) A x * = f A*(S)dp(S). 

J R 

Naturally, all of the above results can be immediately generalized 
to complex measures /jl by decomposing such measures into a sum of 
real measures. 

A certain kind of generalization of Bochner integrals has been 
encountered in (3.33), (3.34), (3.41), and (3.43). We define such integrals 
in a more general context by means of the following concept of 
integration. 

Definition 3*4* Let f x and A (A) be functions in A which assign to 
every A e [a, b\ a vector f x in and a bounded operator A( A) 
on respectively. Suppose that for any sequence of subdivisions 
a = A (0) < A (1) < ••• < A (n) = b of [tf, b\ with norm 

8 = max{| A (fc) — A (fc_1) | : k = 1,..., n} 

converging to zero, and that for any A ,(fc) e (A (fc_1) , A (fc) ] 

(3-75) Hm l (g | ^(A'«*>X/ aW -/,<*-„)> 

U k™ 1 
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exists for some fixed g e If this limit is the same for any choice of 
sequences of subdivisions with shrinking norm S -> 0 and for any 
choice of X ,{k) e(A (fc-1) , A (fc) ], then it is denoted by the integral symbol 

(3.76) (g [ £ A(A) dj^ . 

A corresponding improper integral is defined as 

(3.77) (g | A(A) dj ») = Km (g | £ A(A) dj^ , 

with a similar definition holding when a -> — oo. 

We encountered integrals of the type (3.76) in (3.31) and (3.39). In 
those cases, f x is of the form E x f y where E x is a spectral function and / 
is a fixed vector in . 

For any given interval I C R 1 , the functional 
Mg) = 

is obviously linear. If </>j(g) is defined for all g e and if it is bounded, 
then by Riesz’ theorem there is a unique vector hj which satisfies the 
equality (hj \ g} — <f>j(g) for all geJif. We denote this vector by the 
integration symbol 

(3.78) J^(AK/ a . 

Suppose K x is a operator-valued function. It is easy to see that 

jA(X) d.lMiaJ, + aj 2 )] = J^(A) d^KJ, + a 2 KJ 2 ] 

= a, f A( A) d x KJ x + « 2 j A (A) d A KJ 2 

and that the existence of the integrals on the right-hand side of the above 
relation implies the existence of the integral on the left-hand side. Hence, 
the mapping which takes / into 

Aj(J) = j j A(A)d,KJ 

is linear. We denote the linear operator A l by the integration symbol 

A,= ^A{X)d A K A , 


(3.79) 
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and refer to it as the weak* Riemann-Stieltjes integral over I of A( A) 
with respect to K x . If K x is a spectral function then (3.79) is called 
a (weak*) spectral integral. 


Lemma 3.3. Suppose that Ef is the spectral function of a self- 
adjoint operator A and that JP(A), A e U\ is a family of bounded operators 
such + that F*(A)/e 3 A for all / e ^ If || F( A) || < C for all XgU 1 then 

(3.80) f“V(A)A d x Etg = P^(A)^ d A Efg 9 ge® A , 

and the existence of one of these weak spectral integrals implies the 
existence of the other. 


Proof. We shall prove first that for any a < b y 

(3.81) (/| JV(A)(A-^I)^) =0, feJt, ge® A , 

by establishing that the corresponding Riemann-Stieltjes sum converges 
weakly to zero in limit S —> +0 described in Definition 3.4. Indeed, 


(3.82) 


// £ F(A'<*>)(A'« - A)(E^ - E^g) 


= t <«<*> - - A)F*{xy/\{Ey - Ey^y 

7c=1 


since — Efo- 1 , is a projector and therefore self-adjoint and equal 
to its own square and F*(\' {k) )f e 3 A . The absolute value of (3.82) 
is obviously smaller than 


t ii «<*> - Ey t) )( a' ( *> - j)f*(A' ( «)/h n« (t) - ii 

7c=»l 

{equation continues) 

# In defining (3.79) we used weak limits of Riemann-Stieltjes sums such as (3.75), 
but the limit might exist and yield the same operator also in other topologies, such as 
the strong topology (see PrugoveiSki [1972a] for a systematic treatment). 

f See Lemma 2.1 by PrugoveCki [1973a] for less restrictive conditions under which 
(3.80) is still true. 
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/ 71 \l/2 

< (E ll«cw - <<*-oXA ,tt) - ^*(A'<*>)/II 2 ) 

( » \l/2 

E ll«<*> -<<*-!>) * II 2 ) 

(i (A a) - A **- 1 *) 2 ((* | £ «<« - ^O*)) 




< 


(«i 


■) < 


(A<» - A«-«) C* ||/||* «*<1 - E^g')) 1 ! 2 


< swell/II (6 - ay 2 II (E* - ^)*|| < C(S - «) 1/2 II/II IkII 8W, 

where the second inequality follows by the Schwarz-Cauchy inequality 
on l 2 (n) and the third inequality follows from the following relations: 

ll(A' <fc) - A)(E^ - E^h || 


,A<*> 




f (A'(fc) _ A) 
J A a-i) 


dE?h 


< (A ( fc > — A (fc_l) ) ||(^ (fc) — Efuc^h ||, 

(E% - E^E^ - E^) = S kl (E% - 

Thus, we see that in the limit S —> +0, (3.82) approaches zero, and 
therefore (3.81) is true. In view of the obvious fact that 


/.+oo +V r n+l 

I F(X)(X — A) d h E*g = w-lim £ f F(X)(X — A) d. x E*g, 

N n=—N J n 


we can conclude not only that (3.80) is true, but also that the existence 
of one of the integrals in (3.80) indeed implies the existence of the 
other. Q.E.D. 

By using the Lemma 3.3 we can deduce the following general result, 
which in view of Theorem 3.1 immediately leads to the Hilbert space 
Lippmann-Schwinger equations (3.34) and (3.41). 


Theorem 3*5* If H and H 0 are any two self-adjoint operators 
having a common domain of definition in a Hilbert space then for 
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=F« I"™ e ±ct e im dt 

/»4-00 

1 - f (H - A 0 =F i*T\H - H 0 ) 

J—co 
/.-foo 

1 - f d x E”{H - HoX a - H 0 qF w)" 1 . 

oo 

Proof. By reviewing the derivation of (3.33) and (3.39) we easily 
establish that the only prerequisite has been the self-adjointness of H 
and Hq . If in addition @ H C S> Hq then (H — A 0 ± ie)~ x in the role of 
F(X 0 ) satisfies both conditions of Lemma 3.3 since by (3.60) 

\\(H-\ 0 ±te)-'\\^e-\ A 0 e R 1 , 

and obviously F(X 0 )f e S> H . Hence the equality of the expressions in the 
first and second line of (3.83) follows by applying this lemma to (3.39). 
The last of the equalities in (3.83) can be derived in a similar manner 
by taking adjoints in both sides of (3.33). Q.E.D. 

The most remarkable feature of the Hilbert space versions of the 
Lippmann-Schwinger equations (3.34) and (3.41) is their great generality 
due to very simple and modest assumptions required in Theorem 3.5— 
from which they immediately follow. Consequently, not only do these 
equations lead to Lippmann-Schwinger equations for distorted waves* 
in two-body scattering (see §8.7), but their applicability extends beyond 
the realm of quantum mechanics proper (see §4). It should be noted, 
however, that these equations become trivial identities if M ± consist 
of only the zero vector. This is exactly what happens even in two-body 
potential scattering if the potential is of long range, and in particular 
for the practically important case of Coulomb-like potentials. One can 
then impose, however, space cutoffs on the potential, thus recovering 
nontrivial Lippmann-Schwinger equations, and then study the behavior 
of these equations as the cutoff is removed (see §4.9 as well as Prugovecki 
and Zorbas [1973] for details and further references). Alternatively, 

# These were the equations actually derived by Lippmann and Schwinger [1950]. 
Hilbert space equations equivalent to (3.34) and (3.41) made their first appearance in a 
paper by Tixaire [1959] (and were independently derived and generalized by Prugovecki 
[1969, 1971b, 1973a]), whereas the Riemann-Stieltjes spectral integrals, which form their 
basis, were first introduced by Daletskil and Krein [1956]. In studying and applying these 
Hilbert space equations some authors (e.g., Amrein, Georgescu, and Jauch [1971], 
Chandler and Gibson [1973], Thomas [1975]) prefer using strong instead of weak spectral 
integrals, but then additional restrictions have to be imposed on H and H 0 . 


any e > 0 
(3.83) = 
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one can develop modified Lippmann-Schwinger equations (Prugovecki 
[1973a]) that can be reduced to a form that contains a new effective 
potential and can be then solved by iterative techniques (see Masson 
and Prugovecki [1976]). 


**3.8. Appendix: Scattering Theory in Liouville Space 

Scattering theory for density operators plays an important role in 
nonequilibrium statistical mechanics. * Through the use of the Liouville 
space formalism of §8 in Chapter IV we can immediately adapt the 
results of §§2-3 to quantum statistical mechanics. 

Indeed, a Liouville space ^(Jf) is itself a Hilbert space. Therefore, 
all the general theorems that we have derived in §§2-3 are as applicable 
to JSf(^) as they were to itself once it is established that their pre¬ 
suppositions are satisfied. Thus, the basic difference between quantum 
scattering theory in and statistical-mechanics scattering theory on 
jSf(^) stems from somewhat different physical asymptotic conditions 
in the two cases. The difference in these conditions reflects the difference 
between equations (1.14) and (8.66) in Chapter IV for state vectors 
and density operators, respectively. This leads to appropriate changes 
in the detailed form of (2.46) and (2.47), which in the case of density 
operators, requires trace operations rather than inner products. For 
example, if we work with the density matrices of a complete set of 
operators, the density operator counterpart of the asymptotic condition 
(2.46) states that, by (8.69) in Chapter IV, 


(3.84) 


p A i Am (E\ P Al Am (B) 

Pit) V ' P ex (0 v ' 

= f [<* I pMI *> — <* I p ex WI *>] dfi(x) 

J B 


should converge to zero as t-> ±oo. 

It is quite easy to establish that a condition such as (3.84) converging 
to zero for t — > ± oo is satisfied if 


(3.85) h-lim[/o(£) — p ex (t)] =0, ex = in, out, 


* See Prigogine [1962], Kirczenov and Marro [1974], and Balescu [1975]. A sample of 
specific applications can be found in the articles by Zwanzig [1963], Fano [1963], Snider 
and Sanctuary [1971], Tip [1971], Eu [1975], and Coombe et al. [1975]. The proofs for 
statements made in this section can be found in Jauch et al. [1968] and Prugovefiki 
[1972, 1978b]. 
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although the converse is by no means true. On the other hand, since 
the limit in Hilbert-Schmidt norm appearing in (3.85) is in fact a 
strong limit in and is itself a Hilbert space with an inner 

product that gives rise to that norm, the general part of Theorem 1.1 
is applicable in the present case so that we can state that (3.85) is satisfied 
if and only if 

(3.86) P ( 0) = h-lim £i(t) p ex ( 0 ), Si (t) = e mt e~ iHat . 

£->Too 

If we take into consideration Theorem 8.11 in Chapter IV and recall 
(2.1), we see that for any A e JSf(^), 

(3.87) S2(t)A = D(t) AQ*(t) 

(3.88) Q±AQ±* = h-hm Sl(t) E±A, 

where the superoperators E ± , 

(3.89) E±^4 = E m AE m± , 

are obviously idempotent, i.e., E ± = E ± 2 , and also self-adjoint in 

jSf(^T), 

< \A 1 | E±A 2 y 2 = Tr[^4 1 * J E M± ^4 2J E M± ] = Tr[ E M± A 1 *E M± A 2 ] = (E±A 1 1 A 2 ) 2 . 

Therefore E ± are super projectors in jSf(^). Consequently, (3.86) 
suggests the introduction of the wave superoperators 

(3.90) £l± = h-lim £l(t)E± , 

t-»T oo 

which by (3.87) we related to wave operators as follows: 

(3.91) £l±A = n±AD±* y A e -Sfpf). 

Starting with the definition (3.89) we can restate in a most striaght- 
forward manner every single theorem in §§2-3 in terms of and the 
scattering superoperator S, where 

(3.92) S P = = S P S *. 

In particular, we shall have by Theorem 3.1 

(3.93) = h-lim ft± e E± , 

e -»+0 

J .-I-OO 

e Tst e im e- <Hot dt, 

0 


(3.94) 
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and by the results of §§3.2—3.4, for H x = H — Hq and E ± = 1, 

(3.95) a ±e = 1 + r (A - H ± *«)-«! d,Efo 

J—co 

/.+00 

= 1 - I W-Hoi*? 1 , 

00 

thus obtaining a spectral integral formula for the transition super- 
operator 

(3.96) T = (27rt) -1 (l - S) = (2m)- 1 h-lmi(«* e - n^)n + 

= h-lim f + °° ~ A) 2 + . 

e^+0 J _ a0 

One of the advantages of the Liouville space approach to quantum 
scattering theory is greater generality. Indeed, if we define E ± as the 
superprojector onto the closed subspace of spanned by all 

p (ex) (0) for which (3.86) exists [rather than in terms of M ± by means 
of (3.89)], then we might have E ± = 1 even when Z£ M± = 0. For 
example, assume that M ± = for some choice of H 0 and H in (2.1). 
If we then shift the energy scale by adding a constant c =£ 0 to H, thus 
arriving at H' = H + c, we shall have that 

s-lim e ia ’ % e~ iHot f = e ict s-lim e iHt e~ iH H 

Too ipco 

obviously does not exist except if / = 0. Thus, although nonrelativistic 
theories should not be sensitive to shifts in the energy scale, since 
(nonrelativistically) only relative and not absolute energy values are 
measurable, the standard scattering theory in espoused in §§2-3 
displays a definite (and totally unphysical) dependence on the choice 
of origin of the energy scale of H in relation to H 0 . This weakness is, 
however, removed in the Liouville space approach, since then obviously 

JH't JHt -iU 0 t 

e e 0 p = e e u p 

for any density operator p, so that E ± are unaffected by such shifts in 
energy scale. 

An even more significant advantage of the Liouville space approach 
to scattering theory is its immediate applicability to classical statistical 
mechanics. This enables not only the application of techniques de¬ 
veloped in the quantum context to classical theory (see, e.g., Resiboia 
[1959], Miles and Dahler [1970], St. Pierre [1973], Leaf [1975]), but 
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through the use of the master Liouville space 3? r introduced in §8.9 
of Chapter IV, it makes possible the step by step comparison of classical 
and quantum effects when classical and quantum models for the same 
system are formulated in a common (see Ali and Prugovecki [1977b], 
Prugovecki [1978b, c], as well as §4.10). 


Exercises 

3*L Show that there are unique operators £?_ e ($) and Q +e (s), 
0 < € ^ s, which satisfy the relations 

<g I Q_ e (s)/y = € fe-<Kg I Q(t)/y dt, 

J 0 

<g I «*«/> = - f° e*Kg I W) dt 

J -s 

for all fyge Jf?, and that || fl ±e ($)|| < 1. 

3*2* Prove that 

Q± e = s-lim I^± e (s). 

i -»+00 

33 . Suppose that the Riemann-Stieltjes integral of .F(A) with 
respect to ct(A) exists in [a , 6], that .F(A) is Borel measurable, and that 
ct(A) is nondecreasing. Show that .F(A) is integrable with respect to a 
measure fi(B) which satisfies the relation /x((—oo, A]) — ct(A), and that 

fV(AKa(A)= f F(X) 

3*4* Derive from the basic Definition 3.1 that the integrals (3.18)- 
(3.20) represent linear operations; for example, that 

\ d\[ [^i^i(A, A 0 ) + a 2 F 2 (A, A 0 )] d x o( A, A 0 ) 

J a J I 

= «1 f d x f Fi(K Ao) d x o{\, Ao) + a 2 f d x f F 2 ( A, A,,) d A o{\, A„). 

J a J I J a J I 

33 . Prove that if the Riemann-Stieltjes integral on [#, 6], [a, c ] 
and [< c , b ] exists (a < c < 6), then 

fV(A) da(X) = f°F(X)da(X) + f F(X) da(X ) 9 

J a J a J o 


a < c < b. 
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Extend the proof to show that for cross-iterated integrals 


/.& /» /»c /» /»b /» /• /»c /• /»b 

J a ^ I J a J c J a J / J c 


Remark . If o(A) as well as .F(A) are discontinuous at c , the integral on 
[#, 6] might not exist, although the integrals on [a, c ] and [< c , rf] exist. 


3*6* Show that if o(A) is nondecreasing and -Fi(A) ^ F 2 (X) y then 
| Fj(A) ^ct(A) < f F t (X) da(X). 


Using this result, show that if o(A, A 0 ) is nondecreasing [i.e., o(A, A 0 ) ^ 
cr(A', A 0 ') whenever A < A' and A 0 < A 0 '] and F ± ( A, A 0 ) < F 2 (X , A 0 ), then 

J J ^i(A, A 0 ) d A a(X, XoX j A 0 ) d XQ a(\, A 0 ). 

3*1* Assuming that the respective Riemann-Stieltjes integrals for 
o 1 and o 2 exist, prove that the corresponding integrals for the linear 
combinations a 1 a 1 + a 2 o 2 , a ly a 2 e C 1 , exist and 


J^’(A) <4[«i ct i(A) + « 2 ct 2 (A)] 

= a x J^i ? (A) d^X) + a 2 j^.F(A) d h a 2 { A), 

\ d A ( F( A) rf Ao [«iffi(A, Aj) + a 2 o 2 (X, A„)] 

J i J i„ 

= a± f f F(X, A 0 ) 0 Oi(A, A 0 ) -f- a 2 f f F(X, A 0 ) d^ g 2 (X, A 0 ), 

J J h J J /<, 

where I and I 0 are arbitrary nondegenerate intervals in U 1 . 

3*8* Using the result of Exercise 3.6, prove the mean-value theorem 
for Riemann-Stieltjes integrals: 

If.F(A) is real and continuous, and if ct(A) is nondecreasing and bounded 
on [a, b ], then there is a point A' e [a, b ] for which 


\ b F( A) d x a( A) = F( A') f d x a( A) = F( A') a((a, ft]). 

•'a •'a 


Show that the above integral exists if F( A) and ct(A) satisfy the 
assumptions made earlier. 
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3*9* Prove that if (3.73) exists, then the relation (3.74) holds. 

3 JO* Prove that the resolvent R A (£) of a self-adjoint operator A 
can be represented for Im £ < 0 by a Bochner integral: 

r+<*> 

R a (() = —i\ e i{A ~ C)t dt. 

J o 


4♦ Eigenfunction Expansions in Two-Body Potential Scattering 
Theory 


4.1. Free Plane Waves in Three Dimensions 


In §3 we presented a very general framework for time-independent 
scattering theory. However, in the practically important case of potential 
scattering one encounters much more specialized versions of that 
framework, which are, on the other hand, more convenient to deal 
with from a computational point of view. To derive these formulations 
rigorously from the general framework of §3, we need the existence of 
eigenfunction expansions for the Hamiltonians H 0 and H . 

Before formulating the concept of eigenfunction expansions on a 
general level, we shall elucidate this concept by studying it in the special 
case of the kinetic energy (or “free”) Schroedinger operator H 0 on 
L 2 (R 3 ), defined by means of the differential operator form 


(4.1) 


H 0 ' = 



A = 


a 2 a 2 a 2 

dx 2 dy 2 dz 2 * 


If we consider Hq to be an operator acting on the space of all every¬ 
where twice-differentiable complex functions /(r), then Hq has eigen¬ 
functions <P k (r) 


(4.2) (H 0 '<P k )(r) = £ <*> k (r), <? k (r) = * 

associated with the eigennumbers 


(4.3) A(k) = k 2 /2 m. 

In physical literature the functions <£ k (r) are called free plane waves . 
However, these eigenfunctions are not square integrable on U s in the 
Lebesgue measure, and therefore they do not represent eigenvectors 
of H 0 , considered as a self-adjoint operator in L 2 (IR 3 ). 
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The family {@ k (r): k e U 3 } of eigenfunctions of Hq has some remarkable 
properties, which follow from Theorems 4.5 and 4.6 of Chapter III. 

According to Theorem 4.5 of Chapter III, if i[feL 2 (U 2 ), and if 
B ± , B 2 ,... are any bounded measurable sets, the functions (x* n */0( r ) 
have Fourier transforms U F (x B Jf), since for every n= 1,2,... the 
function Xb ( r ) *A( r ) integrable as well as square integrable with respect 
to the Lebesgue measure. On the other hand, if Un=i B n = then 
II Xb — II 0 when n —> +oo, and therefore, in view of Theorem 4.6 
in Chapter III, 

(4.4) U f i/j = s-lim U F (j( B ^)» 

n->+oo 71 

Let us introduce some convenient notation in Definition 4.1. 

Definition 4*1 ♦ Suppose F(x, y) is the kernel of an integral operator 
F from L 2 { <&, v) into L 2 (3T, p) 

(4.5) (Fg)(x) = f F(x,y)g(y)dv(y), 

and that is the family of all functions g(x) for which the integral in 

(4.5) exists. Then we shall write for some A eL 2 (^, v) 

(4.6) (Fh)(x) = l.i.m. f F(x,y) h(y) dv(y) 

j<& 

if the linear operator defined by (4.5) has an unique extension/ = Fh to 
A, i.e., if for any sequence g x , g 2 ,.*• G ^ converging strongly to A there is 
an unique / e L 2 (^ f p) such that 

(4.7) lim f | f{x) - (Fg n )(x)\ 2 dp(x) = 0. 

n-*ao Jap 

We note that if F happens to be a bounded operator on then by 
the extension principle of bounded operators (Chapter III, Theorem 
2.6), l.i.m. \F{x y y) h(y) dv(y) is defined for all heL 2 (&,v) and re¬ 
presents the extension of the operator F to A. In particular, if F is the 
Fourier-Plancherel transform U F , then 

(4.8) ip(k) = (t/ F ^Xk) = (2tt)- 3/2 l.i.m. f e-^(v) dr 

J u 3 

is well defined. Moreover, by Theorem 4.6 in Chapter III we also have 

(E*(B n mr) = (2n)~™{ e^(k)dk, 

J B„ 


(4.9) 
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and consequently 

(4.10) ^>(r) = (27 t) _ 3/2 l.i.m. f e^f(k) dk. 

The formal analogy of (4.10) and (4.8) with the expansion formula of 
a vector i/i e 


= £ <«» I <A> e„ , 

n 

in an orthonormal basis e k becomes obvious if we rewrite these two 
formulas in the form 

(4.11) = l.i.m. J ^>p(k) 0 k (r) dk, <p(k) = l.i.m. J 0 k *(r) i/>(r) dr 

and introduce the convenient notation 

(4.12) (0 k | 0> = l.i.m. J 0 k *(r) 0(r) dr = k). 

The fact that (0 k | /) is not related to the inner product in L 2 (R 3 ) is 
emphasized in this notation by the round bracket in (• | •). 

Due to such analogies, the relations (4.11) are referred to as eigen¬ 
function expansions of ifj( r) by means of free plane waves <2> k (r). 

It is important to realize that we can easily express the spectral 
measure E H *(B) of H 0 with the help of <2> k (r). Indeed, we recall that the 
momentum operators are essentially multiplication operators when 
they act on $(k), 

(4.13) (P<*ty)~ (k) = kj(k), (P<»ty)~ (k) = kji(k), (P<’ty)~ (k) = k$(k), 

so that the k variables are in this case identical to the momentum 
variables p. Since H 0 is a function of P, 

< 414 > H »-sr= J < p >’ 

we have, in the momentum space, 

(£*o(£)0)~(k) = XA _ 1{B) (k)f(k), 
or equivalently, in the configuration space, 

(4.15) (E H o(B)ifj)(r) = l.i.m. f 0 k (r) $k) dk . 

J a-Hb) 
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4.2. Distorted Plane Waves 

Let us now consider a total Schroedinger operator H defined by means 
of the differential operator form 

(4.16) H r = -(l/2m)A + V(r). 

In §§6-7 we shall extend to H r the above results on Hq by proving that, 
under certain assumptions on the potential F(r), the following state¬ 
ments are true. 

For each vector keR 3 — \ , there is a unique solution* 

<^ +) (r) = (2 tt)-s/ 2 (e*' + «k(r)) 


of the differential equation 

(4 - 17) - i A0 ‘ +)(r) + V(r) 0 * +)(r) = 0 * +>(r) 

for which, in spherical coordinates r, 9 , and <£, ^ k (r) behaves asympto¬ 
tically as follows: 

(4.18) v k (r) ~ MM1 e ikr f r = | r j +00> * = | k |, 

where f k (9 , cf>) is a function uniquely determined by ^ k (r). The family 
{<&<+>(!•): k e R3 — &*%} provides an eigenfunction expansion for any 
element eL^IR 3 ) in the sense that 

(4.19) ^+(r) = l.i.m. J* <2>< +) (r) <£(k) dk, 

J m 3 

where Ll c (K 3 ) = E H (S^) L 2 (U Z ) denotes the closed linear subspace of 
L 2 (R Z ) corresponding to the continuous spectrum of H. 

The function 0(k) appearing in the eigenfunction expansion (4.19) 
is not the Fourier-Plancherel transform of 0+(r), but rather the Fourier- 
Plancherel transform of another function related to by the equation 
In fact, one of the most important results (which will be 
derived in §7) of time-independent potential scattering is that Q + is in 
the present case a partial isometry with initial domain M 0 = L 2 (U 3 ) and 
final domain R + = L^ c (IR 3 ). Hence, to every e (IR 3 ) corresponds a 
unique 0 e L 2 (U 3 ) such that = Q + i/j. 


* i s an exceptional set of Lebesgue measure zero on which Fredholm techniques 
do not yield a unique ^^(r) (see Theorem 6.1). 
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In physical literature the functions 0£ +) (r) are referred to as the out¬ 
going (or retarded) distorted plane waves , while f k (9 , <f>) is called the 
scattering amplitude since, as we shall see in §6, it is intimately related 
to the function T (1) (p\ co, co') introduced in (1.31): 

(4.20) 

AH = - (2tt) 2 I k |-' T<1)(| k I; « , oj), « = (0 ,</,), O> 0 = (0,0). 

The justification of the term “distorted plane wave” is that in the 
special case when V(r) == 0 we have a k (r) == 0, and ^ k +) (r) becomes the 
free plane wave <P k (r). In addition, the function 

(4.21) &l +) (r; t) = 0[ +) (r) exp[-i(k 2 /2m)<] 

provides a solution of the time-dependent Schroedinger equation 

(4.22) i^ 0 k +) (r; t) = [-(l/2m)A + F(r)] ^ +) (r; t). 

The “distorted part,” *> k (r) exp[— i(k 2 /2m)t] of this wave describes, 
heuristically speaking, a process which recedes away from the scattering 
center r = 0. 

The functions 

(4.23) $1rV) = 0££*(r) 

are called incoming (or advanced) distorted plane waves for analogous 
reasons. They also satisfy (4.17), and for any gL^JR 3 ), we have the 
expansion 

(4.24) i/>_(r) = l.i.m. f <p£ -) (r)$k) dk, 

J R 3 

where ip_ = Qjp, with uniquely determined by i/j_ due to the fact 
that M 0 == L 2 (R 3 ) and R_ == L* c (R 3 ). 

A further significant result of §6, which in fact represents a generali¬ 
zation of (4.19) and (4.24), is that the spectral measure E H (B) can be 
computed onL^ c (IR 3 ) by means of ^^(r): 

(4.25) (E H (B)$ ± )( r) = l.i.m. f <P^(r) #k) dk, 4> ± = 

In particular, for B = R 1 , the relations (4.25) assume the form (4.19) 
or (4.24), respectively. It is important to realize that (4.15) is also a 
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special case of (4.25), since when V{t) == 0, we have Q + = Q_ — 1 
and L%JU B ) = L 2 (R 3 ). 

4.3. Free and Distorted Spherical Waves 

The free plane waves 0 k (r) are not the only eigenfunctions of the 
differential operator HJ,. It is easy to verify that the functions 

(4.26) 0 km (r, 0, ft = (2 M^Ukr) Y{\9,4) 
are also eigenfunctions of HI : 

(4.27) HI 0 Mm (r, e,4) = ~[-^r+- r ~-^u] 0 klm (r, 6,4) 

k 2 

2m 9 

^ 4 ‘ 28 ^ = sTnl [lid Sm 6 ~m\ + sin 2 0 dfi ’ 

for any values 

(4.29) 0 ^ k <c -j-oo, / = 0, 1, 2,..., m = —/, —/ -f- 1,..., -j-/. 

The function & klm (r, 9, <f>) is called an outgoing free spherical or partial 
wave . It follows from (4.10) and the relation (7.70) in §7 that the family 
of all functions (4.26) provides us with an eigenfunction expansion: 

(4.30) 4( r)=£ £ l.i.m. C 0' kl Jr)$Uk)tf dk, 0' klm (r) = 0 klm (r, 9,4), 

1 =0 ra=-Z J 0 

where, using the notation Q v introduced in (7.10) of Chapter II, we have 

(4.31) $ lm (k) = l.i.m. f 0(r, 9 , c/>) r 2 sin 9 dr d9 d<j >. 

% 

In the case that V(r) is spherically symmetric, i.e., V(r) = V 0 (r), the 
differential operator H r has the eigenfunctions 

(4.32) 0lZ(r, 0,4) = (2lnfVR^\r) Yr(0,4). 

These eigenfunctions are called, respectively, outgoing and incoming 
distorted spherical waves and are generalizations of ; i.e., 

(4.33) HVfiUr, 0,4) = [- — A + V n (r)) l(r, 9 , 4 ) = £- », 4 ), 
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and for any e L%f IR 3 ) 


(4.34) <A ± (r) = £ £ f *&) &.(*) ** d*. 

i=0 ra=-J J o 

where ifi ± = and is given by (4.31). 

It is well known (see Butkov [1968, Section 9.10]) that the asymptotic 
behavior for large r of the spherical Bessel function j\(kr) is 

(4.35) ji{kr) ~ (1 /kr) sin(Ar — Itt/I), r —► +oo. 

We shall see in §7 that the functions Rj^\r) have a similar asymptotic 
behavior 

(4.36) ~ (1 M sin (kr - (hr/2) + 8 t (k)). 

In physical literature 8 t (k) are called phase shifts , since they represent 
the change of phase from the free spherical waves corresponding to the 
noninteraction case V(r) = 0, to the distorted spherical waves. We shall 
see in §7 that they play a crucial role in the computation of the S operator, 
which can be written in the form 

(4.37) (S^Uk) = expp/8^)] $ lm (k). 

The function S t (k) = exp[2fSj(&)] is called the partial-wave S matrix. 
The reason for this terminology is that if we use Dirac’s bra and ket 
notation to write | kirn) = & klm , then taking in (4.31) and (4.36) 
p = 0 klm and working formally, without paying too much attention 
to the real mathematical meaning of these expressions, we get 

(k'l'm' | S | klm) = [kk'Y 1 8(*' - k) h Vl 8 m ' m exp[2* S ? (&)]. 

In §7 we shall establish the following generalization of (4.34): 

(4.38) TOf)(r) = f J )hjk)k*dk, 

1=0 m=—l J Aq X (B) 

where 

(4.39) A 0 (k) = A 2 /2 m. 

These formulas can be recast in a more compact form if we introduce 
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the measure /u. 0 = X /u (w) on ^ s , where 

= f k 2 dk, B 1 e@\ 

Jb 1 

(4.40) 

= £ X **(/,«), S 2 e^ 2 - 

i=0 w =—l 

It is easy to verify (see Exercise 4.1) that, written in terms of the measure 
/x 0 , (4.38) becomes 

(4.41) 0±)(r) = l.i.m. f ^ifi(r) $(k, l, m) d^{k, l, m), 

J a^Hb) 

where Z, m) and ^tm( r ) are arbitrary extensions of the functions 
$i m {k) and ^^( r )> respectively, outside the support of the measure 
fji Q and to the whole of R 3 . In particular, for H = H 0 , we have 

(4.42) (E H %B)4,)(r) = l.i.m. f 4> klm [v) f(k, l, m) d, H {k, l, m) 

J AqHb) 


as a special case of (4.41). 

4.4. Eigenfunction Expansions for Complete Sets of Operators 

The free plane and spherical waves introduced in the preceding two 
sections set the pattern for both the mathematical and physical meaning, 
as well as the role, of free (or unperturbed) eigenfunction expansions in 
scattering theory*: in both cases we are dealing with integral kernels 
of unitary operators U appearing in Definition 5.2 of Chapter IV when 
executing the transition from the configuration representation space 
L 2 (R 3 ) (namely the spectral representation space of position observ¬ 
ables) to the spectral representation space L 2 ((R n , (jl) of another complete 
set of observables, namely {P {x \ P {y \ P {z) } and {| P |, | L |, L (z) }, re¬ 
spectively. Indeed, in the first case L 2 ((R™, p) is the momentum re¬ 
presentation space L 2 ((R 3 ) of wave functions $(k) on which P {x) , P {y) , 


* More abstract approaches based on rigged (Gel’fand and Vilenkin [1968]) or equipped 
(Berezanskii [1968, 1978]) Hilbert spaces are feasible (see, e.g., Antoine [1969], PrugoveiSki 
[1973b], Ascoli et al. [1978]), but on one hand a definitive treatment is still pending, 
whereas on the other hand the more elementary approach adopted in this section suffices 
in potential scattering and is also close to computational techniques actually used in 
practice. 
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and P {z) act in accordance with (4.13), whereas in the second case 
L 2 ([R n , /x) equals L 2 ([R 3 , /x 0 ) since 

(4.43) (| P | MTJk) = k^'Jk), (L< 2> >/>)? m (k) = n4 lm {k), 

and for L' in (7.44), Chapter IV, 

(4.44) = l,p lm (k), U = L\L' + 1). 

We shall generalize now the above features of free plane and spherical 
waves into a concept of eigenfunction expansions that is applicable to 
all other choices of complete sets of observables in two-body as well as 
in multichannel scattering theory. 

Definition 4*2* Let {X 1 ,..., X m } and {Y 1 ,..., Y n } be two complete 
sets of operators in L 2 ([R m , /x) (in the sense of Definition 5.2 in Chapter 
IV). We shall say that <& y (x) is an eigenfunction expansion for {Y 1 ,..., Y n ) 
in the spectral representation space L 2 ((R m , /x) of {X 1 ,..., X m } if there is 
a spectral representation space L 2 ((R n , v) for {Y ± ,..., Y n } and a unitary 
transformation U of L 2 ([R m , /x) ontoL 2 (IR n , v) such that 

(4.45) f(y) = (Uf)(y) = l.i.m. f *,*(x)f(x) d^x), /x), 

(4.46) /(*) = {U-i){x) = l.i.m. J <t> v (x)f(y) dv(y), /eX*(R* v), 

(4.47) {UY j U-'f){y)=yJ{y), y = (y, ,...,y n ), /=1,...,«. 

Clearly, the free Hamiltonian H 0 is in potential scattering a function 
of the complete set {. P {x) , P iy) , P {z) } as well as of {| P |, | L |, L {z) }, as seen 
from (4.14) and (4.39): 

H 0 = |P \ 2 /2m = A 0 (\ P |). 

Hence, in general we can expect that the free Hamiltonian (or, in the 
language of perturbation theory, the unperturbed operator) shall be a 
function 

(4.48) H 0 = A(Y 1 ,..., Y n ) 

of the complete set {Y ± ,..., F n }, so that by Theorem 5.3 in Chapter IV, 
(UH n U~i)(y) = A(y)/(y). 

Consequently, by (4.46) we arrive at the following generalizations of 
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(4.15) and (4.42): 

(4.49) (E H %B)f)(x) = l.i.m. f ® y (x)f(y) dv{y). 

J A- 1 (B) 

As partial isometries with final domains R ± , the wave operators . Q ± 
can also be viewed as unitary transformations of M ± onto R ± treated 
as Hilbert spaces in their own right. If Y ± Y n leave M ± invariant, 
then so do 

(4.50) Yf> = j = 1,..., n, 

leave R ± invariant, and from Theorem 5.1 in Chapter IV it can be 
easily inferred that {Fj^,..., Y^} constitute complete sets of operators 
in R ± . By (2.4), (2.11), and Theorem 2.5 in Chapter IV, 

(4.51) = A(Y^\..„ Y^) = Q ± A(Y t ,..., Y n )Q ± * 

are the restrictions of H to R ± . Thus, if eigenfunction expansions 
0 { ^ z) (x) of {Y^,..., Y^} in the spectral representation space L 2 (U m , /jl) 
of {X x ,..., exist, so that in accordance with Definition 4.2, 

(4.52) 

t\y) = ( U (±) g)(y) = l.i.m. J &f y \x) g(x) djx{x), ge R ± C L*(R», /x) 

(4.53) 

*(*) = (£^-*^)(*) = l.i.m. J #^(*) t\y) My), e f/M± , 

(4.54) (= yi i m (y), / = 1.....«, 

where* [/ (±) £ = £/&*_*£ for all ^gR ± , then the argument leading to 
(4.49) in the present context yields 

(4.55) (E H± (B)g)(x) = l.i.m. f 0? (x) t\y) dv(y). 

j a~Hb) 

By setting above g = f ± = Q ± f, so that g {±) = /, we arrive at the follow¬ 
ing generalization of (4.25) and (4.41): 

(4.56) (E H± (B)f ± )(x) = l.i.m. f *?>(x)f(y) dv(y). 

J A~ 1 {B ) 


* In the notation introduced in (4.12) and (4.81), this condition ensures that | g> = 
(<Py\Q±*gy, so that in a sense (cf. Prugove£ki [1973b]) = (&£*'•■)#„, where 

Q±* + is the extension of Q± to a larger space JT+ D 
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Although in perturbation theory terminology 0 y (x) and 0\f\x) 
represent the eigenfunction expansions for the unperturbed and per¬ 
turbed problem, respectively, motivated by their physical interpreta¬ 
tion, the common names in physical literature are as follows: 0 y (x) is 
called a free wave , 0 { f\x) an outgoing (or retarded) distorted wave , and 
0 { ~\x) an incoming (or advanced ) distorted wave. 

4.5. Green’s Operators and Green Functions 

There is an intimate relationship between the spectral measure E A (B) 
of a self-adjoint operator A and what in physical literature is called the 
Green’s operator 

G a {Z) = -R a (0 = (£ - A)-\ {EC 1 -S', 

i.e., the negative value of the resolvent in resolvent set of A. Part of 
that connection will be investigated in §5.8, whereas in the present 
section we shall concentrate on those aspects in which eigenfunction 
expansions play a crucial role. 

To make our remarks immediately applicable to both the case of H 0 
as well as H in potential scattering, let us be general and assume that A 
acts in the spectral representation space = L 2 ( R m , fi) of a complete 
set {X x ,..., X m }, which we can always decompose into the direct sum 

(4.57) 34? = 0 = E A {Sg)Jf, J^ C A = E A ( Sc )^. 

The restriction A p of A to J^ p A is an operator with a pure point spectrum, 
and therefore the considerations of §5.1 in Chapter III apply to it, so 
that on 3f v A 

n\ 

(4.58) E A P (B) = E A (B) E A (S A ) = E A (B nS A ) = £ £ |/ A . a ></ A , I, 

AeflnSp" 4 a=l 

where {/ A a : a = 1,..., n A } is an orthonormal basis in E A ({ Con¬ 
sequently, by (3.59) 

(4.59) </1 G a (Q E A (S A )g) = £ £ < X I . 

P AeS p ^ a=l 5 ~ A 

Expressing the L 2 ((R m , / jl ) inner products on the right-hand side of 
the above relation in terms of integrals and introducing 
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we can rewrite (4.59) in the form 

(4.61) </1 G^O E A (S A )g) = f d,,(x)f*(x) J d,,{x') G Ap (x, Qg(x') 

if we assume that the orders of summation and integration can be 
interchanged. 

To arrive at a similar formula for the continuous spectrum of A , 
we shall assume that the restriction A c of A to Jf? c A is a function F(Y 1 ,..., 
Y n ) of a complete set {Y ± ,..., Y n } that possesses an eigenfunction ex¬ 
pansion O y (x) on L 2 ((R m , jj). Thus, by Theorem 5.3 in Chapter IV 

(U(£ - A c )-W~^)(y) = g{yM -F{y)\, 

and consequently according to (4.45) and (4.46) for / and g from a 
dense set F (cf. Definition 4.1) in L 2 (U m , /jl ), we shall have 

(4.62) <f\G A (QE^) g y 

= f Hy)^ - F(y)}- 1 \ dfM(x)f*(x)& y (x) f d^(x f ) g{x f ). 

J U n * , 0S m 


Assuming that the function 

(4.63) G Ac (x, x'; Q = l.i.m. dv(y) 

exists and that orders of integration can be interchanged in (4.62), we 
finally obtain from (4.61)—(4.63): 

(4.64) </ 1 G A (0g> = J d^x)f*(x) J d^x') G a (x, x'; Q g(x'), 

(4.65) G a (x, x'; Q = G Ap {x, x'; Q + G„ c (x, x'; Q. 

The fji X ^-almost everywhere unique (see Exercise 4.2) function 
G a (x , x £) that satisfies (4.64) for /, g eL 2 ([R m , p) is called the Green 
function of A on the spectral representation space L 2 ((R m , p). In Theorem 
5.1 we shall explicitly compute the Green function G 0 (r, r'; £) of 
the free two-body relative motion Hamiltonian H 0 in the configuration 
representation space L 2 ((R 3 ). In physical literature G 0 (r, r'; £) is called 
the free Green function. As we shall see in §5.3, the full (or total) Green 
function G(r, r'; £) corresponding to H can be computed from the 
second resolvent equation (3.61) rewritten in terms of Green functions 
[see (4.76)]. 
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A closely related concept is that of retarded and advanced Green 
function G { A \x y x'; A) and G ( A \x y x f ; A), respectively, defined for 
A e S 0 A . If they exist, these two functions by definition have to satisfy 
the respective conditions 

(4.66) f <W*)/*(*)f djx{x') Gf\x, x';X)g(x') 

= lim dfjb(x)f*(x) dfjb(x') G A (x y x'; A i ie)g(x f ) 

6 >±0 

for at least a dense set of values/, g eL 2 (IR m , /x). 


4.6. Lippmann-Schwinger Equations for Eigenfunction 
Expansions 

By taking advantage of the fact that, as we shall establish in Theorem 
5.1, the free Green function G 0 (r, r'; £), feC 1 - [0, oo), is a known 
quantity in two-body potential scattering, and that the corresponding 
advanced and retarded Green functions Gi ±) (r, r'; A), A e [0, oo), exist, 
we derive from (3.34) the Lippmann-Schwinger equations 

(4.67) <P^(r) = <P k (r) + f Gf>(r, r'; k 2 /2m) V(r’) <p£V) dr' 


for the distorted plane waves introduced in §4.2. As we shall see 

in §6.2, (4.67) provides a viable method of computing ^i ±) (r) for a 
wide class of short-range potentials. 

First of all, we note that by Exercise 2.7 (and more generally by 
Theorem 7.1 in §7), M ± = L 2 (U 3 ) for the considered class of potentials, 
which includes all short-range potentials of practical interest (see 
Appendix 4.9 for long-range potentials). Hence, we can rewrite (3.34) 
in the form 

(4.68) <g\f±}-<g\f> 

= lim (g | J + J G 0 ( A ± U)V d x E»f^ , f ± = Q ± f, 

where by (4.11) and (4.19) 

(4-69) Q \f ± -/> = f drg*(r) f dkf(k)[0^(r) - <P k (r)]. 
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The key idea in the derivation of (4.67) is to show that 
(4.70) (j-1 J + J G 0 (A ± U)V W/±) 

= f drg*(r) f dkf(k) f rfr'G 0 (r,r';k 2 /2m±t e )F(r')^V) 

*^05 3 •'(R 3 *'os 3 

for all functions ^(r) e ^^(U 3 ) and /(k) e ^V° C ^ b 0 (IR[ 3 ). ^V° consists, 
by definition, of all continuous functions whose compact support is 
disjoint from an exceptional set \ (assumed to be of Lebesgue measure 
zero—see §§6.2 and 6.5) of values k e U 3 at which ^{^(r) might not 
exist, and is such that for ^(k) e^V°, (4.19) and (4.24) become 

Ur) = (<2±<A)(r) = f *F(r)fo)dk. 


Then, upon taking in (4.70) the limit e -> -j-0 we can argue that the 
outcome can equal (4.69) if and only if the square bracket in (4.69) 
equals the last integral in (4.70). 

Indeed, by (4.25) and Definition 3.4, for any a , b eU 1 , a < 6, 

(4.71) (g | £ G 0 (A ± u)V d x E h H f±^ 

= £ f drg*(r) f rfr'G 0 (r, r'; A/ ± u) V(r') 

>U j =1 ‘'dS 3 * / 0S 3 

X f db0^(r')/(k), 

J A i _ 1 <k a /2m<A i 

where the notable fact is that Ay' e (Ay_ x , Ay] so that, heuristically speaking, 
Ay' and k 2 /2m merge together in the limit S -> 0. On the rigorous level, 
we have to assume that ®k fc) (r) is uniformly bounded in r when k is 
restricted to supp /, which is compact. This condition is certainly 
satisfied by <Z> k (r), and m §6 we shall establish that it is also satisfied 
by 0[ ±) (r) for a large class of potentials.* The potentials in that class 
are locally square integrable and of faster than r~ 3 decrease as oo, 
so that considering the form (5.6) of the free Green function we see 


* This will be achieved by solving the Lippmann-Schwinger equation (4.67), yet the 
reasoning is not circular. The procedure is rather analogous to, say, the establishing of 
power series solutions for some ordinary linear differential equations (see Ince [1956, 
Chapter XVI]), whereby one starts by assuming the existence of such solutions and upon 
computing the coefficients, one proves that the assumption was justified. 
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that the interchange in orders of integration in r' and k can be carried 
out in (4.71) due to Fubini’s theorem (see Exercise 4.3). Furthermore, 
Lebesgue’s dominated convergence theorem can be invoked to establ¬ 
ish that the right-hand sides in (4.70) and (4.71) are indeed equal, 
since upon taking the sum under the integral sign in (4.71) the resulting 
integrand is majorized in r, r', k e IR 3 by the integrable function 

(4.72) const. | £(r)| | V(r')\ \ r — r' I" 1 |/(k)| exp[—(2/we) 1 / 2 | r — r' |]. 

Finally, upon inserting the outcome into the right-hand side of (4.68), 
the same theorem can be applied again (see Exercise 4.3). 

Obviously, the main ideas behind the derivation of (4.67) are not 
confined to position and momentum observables, but are equally well 
applicable to arbitrary eigenfunction expansions 0 y (x ) (see Exercise 
4.4), and in particular to partial waves, when the role of {Y 1 ,..., Y n } is 
played by {| P |, | L |, L {z) }, and that of {X x ,..., X m } by the position 
observables {| Q |, ® Q , <P Q } corresponding to the spherical coordinates 
(r, 0 , <^). If in this instance the potential is spherically symmetric, we 
easily arrive at the following partial wave Lippmann-Schwinger equations , 

(4.73) <pg l(r,0,</>) = <P klm (r >(>></>) 

+ f Gi ±U (r, r'\ k 2 /2m) V(r') 0$,(r', 6, <j>) r' 2 dr'. 

J o 

where GQ ±)l (r , r') is given in (5.22). 

The solution-type Lippmann-Schwinger equations (3.41) in Hilbert 
space also have their counterparts for eigenfunction expansions. For 
example, duplicating the argument leading to (4.67) with (3.41) rather 
than (3.34) as a starting point, we obtain 

(4.74) <P^(r) = 3> k (r) + f G (±) (r, r'; k 2 /2m) V(r') 0 k (r') dr'. 

•’os 3 

The only difference is that in the present derivation we have to make 
assumptions about the full retarded and advanced Green functions 
G (±) (r, r'; A), rather than about In §5.4 we shall see, however, 

that these functions can be computed by means of the second resolvent 
equation (3.65), which can be written in the form 

(4.75) <g | G(0/> = <g I G 0 (t)/> + <£ | G 0 (£) ff^t)/), figs*. 

Indeed, upon expressing these inner products on the spectral representa¬ 
tion space of, in general, a complete set {X x ,..., X m }, and considering 
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that (4.75) holds for all /(#), g(x) e ^ b 0 ([R m ), we infer that 

(4.76) G(x, x'\Q = G 0 (x, 0 + f G 0 (x, x"; V)[IlfG(x", x'\ £)] 

As we shall see in Theorem 5.6, in two-body potential scattering the 
equation (4.76) can be solved by the Fredholm method. Combined 
with (4.74) this method provides a means for the computation of 
distorted waves that is totally equivalent to the method based on solving 
the Lippmann-Schwinger equations (4.67). 


4.7. The On-Shell T-Matrix 

Upon introducing the spectral integral 


(4.77) 


1 /*+°° 
t< = - I d,F“°n v 
77 •'—00 




(Ho - A) 2 + e 2 


l r + “ 

77 


L^lirr 


-Q, 


(H-X) 2 + € 2 +i 

we see that (3.52) is equivalent to the simple statement that 
(4.78) 


T = w-lim . 

e ->+0 e 


To recast (4.77) in terms of eigenfunction expansions, we note that 
by (4.48) and by Theorems 2.3 and 5.3 in Chapter IV, 


(4.79) (g | E H \B)f ) = f i*(y)/(y) dv(y). 

j a~Hb) 

Upon generalizing the notation in (4.12), we write 

(4.80) <£ | E H °(B)f> = f (g ( <P y )(<P y | /> dv{y\ 

(4.81) (<P y | /> = l.i.m. f 0 y *(x)f(x) d^x) = </| *,)*, 

or using a simpler, symbolic notation, we have 

(4.82) E h \B) = f \0 y )dv(y)(0 y \. 

j a~Hb) 

Thus, proceeding formally, we see that (4.77) can be written as 

(4.83) t. _ IJI«.) I + ,, ■ 
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The exact meaning of this expression follows from the next general 
result. 

Lemma 4*1 ♦ Suppose that F(X) is a strongly continuous operator¬ 
valued function for which ||F(A)|| < C, that is ^-square 

integrable in x e R m , and that H 1 x 0 y eL 2 (IR m , /x) represents a strongly 
continuous vector-valued function in y e S C U n . If A(y) is continuous 
and g(y) e ^ b °(U n ) then 

(4.84) (g I \ +CO diEpHJMf) = f £*( | F(A(y))f} dv(y). 

\ I *'—00 / •'ggn 

Proof. In view of the strong continuity of F(A(y)) and , 

(4.85) (H&y | F(A(y))f} = f [Hf0 y *(x)][F(A(y))f](x) d^x) 

J u m 

is continuous in y, so that the integral on the right-hand side of (4.84) 
certainly exists if g(y) e ^ h °(U n ). On the other hand, by (4.82) and 
Definition 3.4, 

(4 - 86) (g\f a d * E "° n imf) 


= lim 

< 5>0 


n p 


j =l j-i<A(y )Oj 


<£ I *,) dv{y){<P v I H 1 F(X/)f ) 


and this limit can be shown (see Exercise 4.5) to equal that same integral 
when we let a —> — co and b -> + oo. Q.E.D. 

Under the restrictions imposed in §6 on V(r) (see also Exercise 4.3), 
the conditions of Lemma 4.1 are satisfied, so we can certainly write 


(4.87) <£ | Tjy 


l ~j <g l® k )( 

77 •'in 3 \ 


V0 k i a. 


Iff, - (k 2 /2«)]• + e 2 




dk 


whenever £(k) e ^ h °(U 3 ). By (4.19) for /(k') e ^ b °(IR 3 — Sf v z ), 


(q 


+ 


€ 

[H, - (k 2 /2m)] 2 + 




e /(k') 

[(k' 2 /2m) - (k 2 /2m)] 2 + e 2 


dk' 


provided that (as shown in §6) < Z>k +> ( r ) has the same continuity and 
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asymptotic properties as 0 k (r). Thus, under the same conditions, 


(4.88) 


( 


V0 b I Q 


-s. 


+ [H 0 - (k 2 /2m)] 2 + 

«<M/> 




R * dk [(k' 2 /2m) 


(k 2 /2m)] 2 


■f drV(r)0 k *(r)0^(r), 

m3 


since orders of integration can be interchanged by Fubini’s theorem 
if we assume the integrability of V(r) on R 3 , as that assures the existence 
of the following integral for k e R 3 and k' e U 3 — : 

(4.89) | F | 0<t>) = f <P k *(r) F(r) <p£H(r) dr. 

*^05 3 

The matrix-type notation on the left-hand side of (4.89) is a variant 
of the Dirac-type of notation that is in very wide use in physical literat¬ 
ure, and the function of k, k' e R 3 given by (4.89) is referred to as a 
transition matrix or T-matrix. By using this expression and (4.88), and 
inserting the outcome in (4.87), we get 

(4.90) 

<* i Tjy = /_*<, | j^k' 8.(J^ - ■£) <«, l v l 1 />, 

where we have introduced the abbreviation 


(4.91) 8 e (A - A') = tt-i{€/[(A - A') 2 + €«]}. 

In (4.90) we can then explicitly take the limit e -> +0 whenever the 
T-matrix (4.89) is continuous in k e IR 3 and k' e R 3 — 5^ 3 so that the 
following lemma is applicable. 

Lemma 4*2* If the function/(#) is bounded on any finite interval 
in [0, +oo) (or (— oo, 0]) and such that its limit /(+0) from the right 
[or/(—0) from the left] exists at the origin, and if S e (#)/(#) is integrable 
on [0, +oo) (or (— oo, 0]) for any 0 < e < e 0 , then 

/»±CO 

(4.92) lim (2 /tt) j # [<r/(* 2 + <r 2 )]/(x) dx = ±/(±0). 

Proof. To prove the lemma for the first case of integrals over [0, + oo), 
choose for given rj > 0 some N(rj) > 0 so large that 
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and also such that whenever 0 < e ^ e 0 , we have 

8 e (*) = elir(x 2 + e 2 ) < e 0 lir(x 2 + e 0 2 ), x > N(rj). 
Consequently, for any S such that 0 < 8 ^ N(e) we have 

fWn) | e NM 

8 e (x)f(x) dx < « f [| f(x)\lx 2 ] dx < AT(ij) Mfo)( e /S 2 ), 

•'o •'o 

where M^) = sup{|/(#) | : 0 < x < N(rj)}. Since 


[el(x 2 + e 2 )] dx = arctan(S/e), 

•'o 

we obtain by using the generalized mean-value theorem of integral 
calculus 


/(+0) 

2 


r N(v) 

h,(x)f(x)dx 

J o 




/(+0) 

2 


arctan * | + N( V ) M( v ) ~, 


where inf{f(x) : 0 < x < 8} < y(e, S) < sup{/(x) : 0 < # < S}, so that 
y(e, S) ->/(+0) uniformly in e as S -> +0. Thus, both terms on the 
right-hand side of the above inequality can be made smaller than rj by 
choosing some appropriate S = S(rj) f then making e/S(^) sufficiently 
small, and taking advantage of 2 arctan x approaching tt as x -> + oo. 

The proof for the case of/(—0) runs along similar lines. Q.E.D. 

We note that if f(x) is actually continuous at the origin, then 


/.+00 /. + 00 

(4.93) Hm J »«(*)/(*) dx = /(0) = j »(*)/(*) dx. 


where, following a widespread custom, we have employed Dirac’s 
S(#) symbol, albeit this so-called ^-“function” is interpreted in mathe¬ 
matical literature as a generalized function or distribution,* whereas 

(4.93) is valid even when f(x) is not a so-called test-function. This 
notation has enabled us, however, to formally combine (4.89) and 
(4.90) by writing 

(4.94) Q 1 Tfy = J r(k) s(-£- - -£") < k I T I k '>/( k ') dk dk '> 


# See Dennery and Krzywicki [1967] for an elementary introduction, and Gel’fand 
and Vilenkin [1968] or Treves [1967] for a systematic treatment. 
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where, again according to widespread custom, we have introduced 
the following Dirac-type notation for the T -matrix (4.89): 

(4.95) <k | T | k'> = (3> k | V | <#>) = (3> k | VQ + \ 3> k -). 

In a rigorous approach to deriving (4.94) we can make in (4.40) the 
transition to spherical coordinates for k', and afterwards apply (4.93) 
to the variable k' = | k' | when taking the limit e -> +0. The easily 
computable outcome for £(k) e < ^ 7 b °(IR 3 ) and /( k) e ^^(lR 3 — is 

(4.96) <£ I Tf} = m f dkg*(k)k f da>'<k R | T | *R'>/(*R'), R = kjk, 

where R' is the unit vector in direction of k', and it has the angular 
spherical coordinates co' = ( 9 ', </>'). A comparison of (1.31) and (4.96) 
reveals that 

(4.97) T^(k; a), a/) = mk(kk I T \ kb'} 

where oj and oj' are the angular spherical coordinates of R and R', 
respectively. Consequently, the formula (1.47) for the differential 
cross section a(p —> p') of particles of (relative to the target) momentum 
p scattering off a planar target in the direction of the unit vector 
(with angular spherical coordinates c o') assumes the form 

(4.98) cj(p -> co') = a(p 2 /2/«, w') = {frrfnfi |<pco' | T \ p>| 2 . 

Due to the relation (6.67) (to be established as part of Theorem 6.3) 
between the T matrix and the scattering amplitude / p (^') we arrive at 
the simple yet fundamental formula 

(4.99) *(p - W) = | / p (a/)| 2 , a/ = (6\ f). 

A notable feature of the differential cross-section formula (4.98) is 
that it involves only T-matrix values <p' | T | p) for which | p' | = | p |. 
From the physical point of view this fact is taken to reflect energy 
conservation. Upon introducing in (R 6 the surface of points (p', p) for 
which p' 2 /2 m = p 2 /2m (which in physical literature is called the energy 
shell), it can be said that (4.98) involves only on-shell values of the T 
matrix. 

4.8. The Off-Shell T Matrix and S’ Operators 

The obvious approach to computing the T matrix for a given potential 
V(r) is to solve (e.g., by the methods discussed in §6) the Lippmann- 
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Schwinger equation (4.67) for the distorted plane wave $ k +) (r) and then 
insert the result into (4.89). If, however, the full Green function 
G( r, r'; f) is known, then the solution-type Lippmann-Schwinger 
equation (4.74) can be used to obtain directly the desired solution 
from (4.89): 

(4.100) <k' | T | k> = Urn (4> k , | V{\ + G[(k 2 /2m) + ie]V}\ & k ) 

= f drVZir) V(r) 

J 05 3 

x K(r) + f G<+)( r, r'; k 2 /2 m) V(r') 0 k (r') dr'] . 

L J 

Equation (4.100) suggests the introduction of the following *^(£) 
operators * 

(4.101) ^(£) = V + VG(£)V, £ g C 1 - S", 

which consitute a family of in general unbounded operators in L 2 (U 3 ) 
when £ varies over the resolvent set of the total Hamiltonian H. 

When the free Green's operator is applied to equation (4.101) from 
the left we obtain 

(4.102) G 0 (£)^(£) = [G 0 (£) + G 0 (£) FG(Q]F = G(£)F, 

where the last expression follows from the second resolvent equation 
(3.61) recast in terms of Green's operators. We substitute this expression 
for G(QV into (4.101) and arrive at the relation 

(4.103) nQ = V+ VG a (0 ^(0, £ g C 1 - S* 

known as the Lippmann-Schwinger equation for «^(£). 

Although the ^”(£) operators are well defined inL 2 (IR 3 ), their primary 
importance stems from their close relation to the so-called off-shell T 
matrix 

(4.104) <p' | | p> = (<*V | V + VG(QV \ <2> p ), 

which, generally speaking, is a function of the real variables p, p' g R 3 , 
and the complex variable £ g C 1 — S H . The interest of this quantity 


# It is the custom in physical literature to refer to (4.101) as “the T operator,” albeit 
one is actually dealing with an operator-valued function. To avoid notational confusion, 
authors who follow this custom (see, e.g., Taylor [1972], Amrein et al. [1977]) instead 
of dealing with the transition operator (3.50) work with the “/^-operator” R = —2mT. 
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lies in the possibility of deriving from (4.103) the integral equation (see 
Exercise 4.6) 

(4.105) <p' | n0\ P> = <P' I V | p> + J <P" I ^(01 P> d P" 

where <p' | V | p) is proportional to the Fourier-Plancherel transform 
F(p-p') of V(r): 

(4.106) 

<p' I V I p> = (<£>„' \V\%)= (2tt)- 3 l.i.m. f V(r) exp[*(p - p>] dr. 

J R 3 

Thus, a strategy often followed in computing the on-shell T matrix 
<p' I T | p) is to solve (4.105) for the off-shell T-matrix (4.104), then set 
| p | = | p' | and £ = (p 2 /2m) + ie in that solution, and thus finally 
obtain the desired quantity by letting e -> +0. This procedure is 
symbolized by the standard formal expression for the S matrix, 

(4.107) 

<p' I s i P > = «V - P) - »(■£—£) <■>' k (4- + i0 ) I *>) 

which, in fact, simply reflects the aforementioned computation of the 
on-^hell T matrix 

(4.108) <p'mp> = Jsa (p' \^(-^r + ie ) | p) - 

the insertion of the outcome into (4.94), and finally the computation 
of the transition amplitude 

(4.109) Q | Sfy = <g\f>- 2m(g \ Tf\ 

using in the process the notationally expedient device of writing 

<•?!/>= f £*(p)/(pHP = f dp'g*{p') f dp 8 8 (p' — p)/(p). 

% '|R 3 

Although easily avoidable as well as, strictly speaking, mathematically 
unjustified when used as before in conjunction with functions /(p), 
g(p) eL( 2 )(1R 3 ) (which, in general, might display discontinuities or even 
singular behavior on sets of Lebesgue measure zero), the ^-“function” 
notation has achieved general acceptance in physical literature. One 
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other of the many illustrations of its usage can be found in the generalized 
optical theorem , which states that for | p | = | p' |, 

(4.H0) < P 'm p>* - <p' m P > 

- 2 ” J„ <■>■ i r i »■>* 8 (-£--©«■• i r i ■» 

Without the use of S-symbolism, (4.110) can be written in terms 
of the scattering amplitude (4.20) as follows, 

(4.111) / p (o/) -/ p *(o/) = ( ip/ln ) [ /£(o/)/ p ( to") dt>", | p' | = | p |, 

J Q S 

where to' = (O', <£') are angular spherical coordinates of p' — pio'. The 
relation (4.111) is a reflection of the unitarity of the S operator, as can 
be easily established (see Exercise 4.7) by observing that 

= 1 - 2 7ri(T - 7 1 *) + 4v 2 T*T = (SS*)*, 

and therefore that S*S = SS * = 1 if and only if 

(4.112) T*T = (7 1 * - T)l2m = TT *. 

By setting p = p' in (4.111) we obtain the optical theorem 

(4.113) Im f p (to) = (pI4tt) f cj(p -> to") dto" = (p/4tt) g(E) 

that relates the scattering amplitude f p<a j(cjo Q ) in the forward direction 
o) 0 to the total cross-section o(E) at energy E = p 2 /2m. 

Finally, it is interesting to note that if we use the solution-type 
Lippmann-Schwinger equation (4.74) in (4.90) and take (4.78) and 
(4.108) into consideration, we can then directly relate the transition 
operator T to the ^(Q operators by means of a double spectral integral, 

(4.114) T = w-lim w-lim 8 £ (A - /x) ^(A + i V ) 

by applying Lemma 4.1 to express the k and k' integrations of (4.90) 
in terms of integrations with respect to the spectral function of H 0 . 

**4.9. Appendix: Scattering Theory for Long-Range 
Potentials 

We have mentioned in §2.7 that the standard time-dependent theory 
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developed in §§1-2 does not apply to long-range interactions correspond¬ 
ing in general to potentials of the form* 

(4.115) V v (r) = V 0 (r) + yr-, * < a < 1, 

where V 0 (r) is some short-range potential of the kind to which the 
standard theory is applicable. The same remark is, of course, valid 
for the stationary theory. 

When we have V 0 = 0, a = 1, and y = e x e 2 , we are dealing with 
the important case of the Coulomb interaction between two charges 
e x and e 2 . Hence it is of both physical and mathematical interest to 
learn how the standard theory should be modified in order to cope 
with long-range cases. + 

The rigorous time-dependent scattering theory for long-range 
potential is based on the observation (first made by Dollard [1964] for 
the Coulomb case) that the modified (or renormalized) wave operators 

(4.116) = s-lim exip(iHt) exip{—i[H 0 t + Dft\ H 0 )]} 

co 

exist for the following (asymptotically unique) choices of operator¬ 
valued functions of H 0 and t - 0 

l±y(ml2H 0 y!*ln(4tH 0 ), a = 1, 

(4.117) Z) a (±t,H 0 )= ym?IV- , . 

( ± (l-a)(2H 0 ^* T<a<1 - 

The most mathematically interesting feature of these operators is that 
they intertwine H 0 in (4.14) and H = H 0 + V v given in (4.115), so 
that (2.3)-(2.5) are satisfied if Q ± is replaced by S% ± , and perturbation 
theory can then be pursued in the ordinary manner. Physically, however, 
this fact does not suffice, but it turns out* that expectation values for 

* The case of 0 < a < \ requires iterative techniques involving derivatives of V Y (r )— 
see Buslaev and Matveev [1970]. 

+ See Amrein [1974] for a review of early rigorous results. For some more recent 
results see, e.g., Lavine [1973], Klein and Zinnes [1973], Gibson and Chandler [1974], 
Cattapan et al. [1975], Ikebe [1975], Narnhofer [1975], Semon and Taylor [1975, 1976], 
Gersten [1976], Hormander [1976], Masson and Prugovecki [1976], Rejto [1976], Saenz 
and Zachary [1976], Zachary [1976], Zorbas [1976a,b, 1977], Alsholm [1977], Kitada 
[1977], Saito [1977], Alt et al. [1978], Lapicki and Losonsky [1979], and Rosenberg 
[1979]. 

* See Manoukian and Prugovecki [1971] and Prugovecki [1971, 1976] for proofs 
concerning position, momentum, and angular momentum observables, and Amrein 
et al. [1970], Corbett [1970], and Zachary [1972] for proofs based on the “algebra of 
observables” approach. 
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the modified asymptotic states 

(4.118) ^ ex (0 = « P {»Wo* + DJf; H n )]} ^ ex ( 0 ) 

approach in the limit f —► ±oo those of truly free asymptotic states 
of m 


= e iH ° tl I /ex (0), '^(O) = ^ ex ( 0 ) = At^O), 

so that the physical asymptotic conditions (2.46) and (2.47) are satisfied 
by these free states W ex (t). This in turn implies that all basic physical 
relationships that were studied in §1 can be recovered in the context 
of long-range interactions. 

The oscillatory behavior of exp[z*Z> a (£; i/ 0 )] as a function of t and 
the existence of the strong limit in (4.116) implies that in the long- 
range case 

(4.119) w-lim e iHt e~ iHot = 0, 

t-*T oo 

so that the Moller wave operators (2.1) have trivial initial domains 
M± = {0}. This in turn implies that (see Prugovecki and Zorbas 
[1973a, Section 2]), 

(4.120) V* = w-lim f +C ° (A - H 0 + irfi- 1 V v d A E?'P ± 

v -*±0 J _ 00 

so that the Lippmann-Schwinger equations (3.34) are not satisfied. 
It is, of course, possible to reproduce the considerations of §§3.1-3.2 
adopting (4.116) as a starting point (see Prugovecki [1971b], Chandler 
and Gibson [1974]), and thus arrive at a modified version of (3.34) 
corresponding to a replacement of the integrand of the spectral integral 
in (3.33) by 

exp[i(H 0 — A i I V I)* + #<>)] dt ~ ^ 

0 

but from a computational point of view one runs into difficulties with 
explicitly taking the limit 17 —> ± 0 , due to the presence of oscillatory 
terms generated by exp[z*Z) a (£; H 0 )]* 

One way of eliminating these oscillatory terms (characteristic of 
long-range problems) is to introduce an asymptotically compensating 
operator Z (see Matveev and Skriganov [1972], Prugovecki and Zorbas 
[1973a, b]) in terms of which we can rewrite (4.116) as follows: 

- s-lim e iHt Z^~ iHQt W. 

t >T-oo 


(4.121) 
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Formally, we are then faced with the same kind of expressions as in the 
two-Hilbert space formulations of multichannel scattering theory 
described in §8.6, the difference being that the embedding operator J 
has to be replaced by the asymptotically compensating operator Z. 
Indeed, one can show (Prugovecki and Zorbas [1973a, b]) that the 
equations (8.71) with Z replacing J hold, and that they can be solved by 
iterative methods (Masson and Prugovecki [1976]). 

The preceding outline indicates that two-body scattering theory for 
long-range potentials can be brought in line with the short-range case. 
However, these results are actually of little practical interest, since due 
to screening effects (such as those for the Coulomb potential of a nucleus 
by the electron cloud around it) one can always replace y in (4.115) 
by a screening (or space cutoff) function y(r) that equals y on some 
neighborhood of the origin, but rapidly decreases to zero as | r | —> oo. 
Of course, we can incorporate those particles that give rise to screening 
into the problem, but then one is dealing with a many-body problem, 
so that the phenomenological approach based on a screening function 
presents great computational advantages. Furthermore, it can be shown 
rigorously (see Prugovecki and Zorbas [1973b, Section 7], Zorbas 
[1974]) that in its basic aspects the theory with a space cutoff merges 
into the exact theory as the cutoff is removed. 

A fortuitous aspect of two-body scattering for Coulomb potentials 
is that its eigenfunctions can be computed explicitly by solving (4.17) 
for V(r) = yr _1 (see, e.g., Messiah [1962, Chapter IX, §§7-11]), and 
the addition in (4.115) of the short-range potential V 0 (r) can be then 
treated as a perturbation of this pure Coulomb problem. Consequently, 
the main thrust of most of the work on scattering with long-range 
forces quoted in this section has been in the direction of multichannel 
scattering, the two-body case providing mainly a testing ground of the 
proposed ideas. 

**4.10. Appendix: Eigenfunctions and Transition Density 
Matrices in Statistical Mechanics 

The general concepts and results of §§4.4-4.7 can be adapted with 
ease to Liouville space. In fact, a Liouville space jSf(^) is itself a 
Hilbert space, and therefore from a purely mathematical point of view 
one might simply consider such constructs as eigenfunction expansions 
and T matrices directly in without reference to Jfl However, 

such an oversimplified approach would ignore the basic differences 
between the formulas (1.14) and (8.69) in Chapter IV, which reflect a 
difference in the physical interpretation in JS f(J^) as opposed to Jf! 
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Indeed, whereas for a wave packet iff(x) it is | iff(x) | 2 that is interpreted 
as a probability density, for a density matrix p(x', x") = <#' | p | x ") 
it is its “diagonal” values p(x , #), and not | p(x\ x") | 2 , that play an 
analogous role. Consequently, in a physically correct approach to 
we cannot altogether ignore 

For example, in treating eigenfunctions related to H 0 = # _1 [i7 0 , •] 
[as defined by (8.51) in Chapter IV] we shall use as a starting point an 
eigenfunction expansion 0 y (x ) for a complete set of observables re¬ 
presented by operators Y x Y n in and satisfying (4.45)-(4.48). 
The functions 

( 4 . 122 ) *T) = <*$(*') 0y< x ") 

then provide an eigenfunction expansion for the complete set of super¬ 
operators [Yy, •] and {Yy, •}, j = 1,..., n, but these superoperators 
do not always possess a physical interpretation, although Y x ,..., Y n 
might. On the other hand, we have 

( 4 . 123 ) Ho ,x "&y' t y"(x', X ') = h~'[A{y') - A(y")] x"), 

so that the concepts and results of §§4.4 and 4.5 can be transferred 
with ease to H 0 and x"). 

As mentioned at the end of §3.8, the methods of quantum scattering 
theory can be adapted to classical statistical mechanics for the study 
of transport phenomena. In particular, it has proven advantageous 
to study classical differential cross sections within a superoperator 
formalism inspired by the quantum approach. * In fact, through the use 
of the master Liouville space described in §8.9 of Chapter IV, it is 
possible to provide a unified treatment of both classical and quantum 
differential cross sections. + 

The idea is to start with a density operator describing a state in 
which the incoming reduced particle is to be found within a finite 
cylinder of base B and length l (i.e., within a finite beam), its momentum 
probability distribution being a S-like function S e 3 (p — p 0 ) peaked 
around p 0 . Proceeding very much as in §1.8, one then uses the T 
superoperator formula (3.89) to express the probability of that particle 
scattering in a potential (centered at the origin) within a cone C (+) , 
which is then treated as in (1.40) by letting C (+) become infinitely 

# See, e.g., Miles and Dahler [1970], Coombe et al. [1975], and Eu [1975]. 

1 This treatment has been espoused in Sections 3 and 4 of Prugove£ki [1978c], where 
proofs of the subsequently outlined results can be found. 
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narrow. The key difference between the treatment of §1.8 and the 
present case emerges, however, as one makes the transition to an infinite 
beam [by letting in the present case B and l go to infinity, so that 
S e 3 (p — — p 0 )]. Indeed, whereas in §1.8 prior to taking this 

limit it had proven to be mandatory* to carry out an averaging over an 
infinite plane (reflecting a planar distribution of particles in the target), 
that is not at all the case in the present context. As a matter of fact, 
since the present derivation is applicable not only to the quantum but 
also the classical case, any such averaging would be mathematically ad 
hoc, since the well-known formula (see, e.g., Balescu [1975]) 

(4.124) cj cl (p -> a)') = (b db dfy/dio' 

for the classical differential cross section in terms of the impact parameter 
b can be derived without any kind of averaging. Furthermore, in the 
context of statistical mechanics, any such averaging would be also 
physically unjustified since in such realms of applicability as collisions 
within gases obviously no planar pattern can be claimed to exist. 

The resulting general formula for the (point-target) differential 
cross section is 

(4.125) °s(p - «') = JT dpp »J du$U*)(™lu)<u | T I p> s 
where the transition density supermatrix 

(4.126) <u | T | p> s = -i(2irf(*& | H< s) | ^) 

makes its appearance. The classical cross section (notationally represented 
by s = cl) formally corresponds to x (cl) taken to be a S-function, and it 
indeed turns out to equal (4.124) upon adopting for Hj cl) the interaction 
term in equation (8.74) of Chapter IV specialized to the case of a single 
particle. In the quantum case x (s) is given by (8.80) and (8.86) in Chapter 
IV, and for very precise momentum measurements (i.e., very large 
values of s) <t s in (4.125) appears to be approximately equal to a in 
(4.98) under realistic assumptions about relative orders of magnitude 
of the basic parameters appearing in a scattering experiment (see 
Prugovecki [1978c, Section 4]), but no exact equality has been established. 


* This averaging procedure is common to all rigorous derivations of the formula 
a(p —► a>) = l/p(^)| 2 , and is not peculiar to the specific derivation in §1.8 (see, e.g., 
Messiah [1961, Chapter X, § §5 and 6], Taylor [1972, Section 3-e] and Newton [1979]). 
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Exercises 

4*1♦ Show that the expressions on the right-hand sides of (4.38) 
and (4.41) are identical. 

4*2* Show that if G ( }\x , x'; £) and G ( j\x % x'\ £) are two Green func¬ 
tions of A and //, is o finite, then G i } ) (x 9 x'\ £) = G i ^ ) (x 9 x'\ £) almost 
everywhere in x and x' (with respect to the measure jjl X p). 

4*3* In potential scattering G(r, r'; A ± ie) and G (±) (r, r'; A) are 
given by (5.6) and (5.17), respectively. Prove that if V(r) is square 
integrable and | V(r)\ = O(l/r 2+eo ) for some e 0 > 0, and if | ^ ±) ( r )l ^ 
M b { r) for all k e B, where M B { r) is integrable on U 3 for any compact 
set B disjoint from the exceptional set mentioned in §4.2, then 
provided J | r — r' | _1 V(r ,v ) dr ' < const (cf. Excercise 6.2), 

lim f <Ao*(r) G 0 (r, r'; A + i v ) V(r') #£V) ftk) dr dr' dk 
= f A*(r) G< ±} (r, r'; A) F(r') <P^(r) $(k) dr dr' dk 

for iff 0 (r) e ^(1 R 3 ) and $( k) e ^(1 R 3 ) with supp ^ n 5*^ = 0 . 

4*4* State conditions under which you can derive Lippmann-Schwinger 
equations from ( 3 . 34 ) for arbitrary eigenfunction expansions: 

$>«(*) = 0 y (x) + f Gl ±) (x, x'; A{y)) V{x) $f\x') d^x'). 

J u m 

4*5* Show that if the integration on the right-hand side of (4.84) is 
restricted to A~\(a , 6 ]), the outcome equals the limit in (4.86). 

4*6* Prove that if V(r) is square integrable on [R 3 , then 

<k I ^(01 k'> = (0 k | [i + VG(0] V0 k >y 

exists [in the sense of (4.81) and Definition 4.2] and that (4.105) is satis¬ 
fied almost everywhere in (p', p) e [R 6 . 

Note . If V(r) is also integrable, then its Fourier transform exists 
and is continuous, and therefore (4.105) holds whenever <k I f(Q | k'> 
is continuous in k, k' e R 3 . 

4.7. Show that (4.112) implies (4.111) if (6.62) holds true. 
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5* Green Functions in Potential Scattering 
5.1. The Free Green Function 

Potential quantum scattering is the special case of quantum scattering 
theory in which the underlying formalism is that of wave mechanics, 
and the interaction is determined by a potential. Proceeding as in §§4.7-9, 
in applying the general methods of scattering theory to potential scatter¬ 
ing of two particles we shall limit ourselves to systems of two spinless, 
nonidentical particles interacting via a potential V(r), where r = 
r 2 — r x , and r x , r 2 are the position vectors of the two particles. This 
case exhibits all the essential features of all other cases. For example, 
the case of nonzero-spin particles can be treated essentially by the same 
methods as the zero-spin case when group-theoretical methods are 
applied. 

In the case of systems of two identical particles one must also take 
into account the presence of the Fermi-Dirac or Bose-Einstein statistics. 
From the practical point of view this means that in these last two cases 
the calculations have to be carried out in the respective subspaces of 
antisymmetric or symmetric functions of L 2 (R 3 ), rather than in the space 
L 2 (R 3 ) itself.* Upon removal of the center-of-mass motion (see §1.7), 
the resulting subspaces of L 2 (U 3 ) containing the states of the reduced 
particle consist of wave functions of even or odd parity, respectively. 

In the absence of external fields and mutual interaction, the relative 
motion Hamiltonian H 0 of a system of two spinless particles is the 
kinetic energy operator in the center of mass system that can be viewed 
also as the kinetic energy operator for a particle of reduced mass m 
[see (5.93) in Chapter II]. We recall from Theorem 7.3 of Chapter IV 
that H 0 = T s is a self-adjoint operator for which 

(5.1) (H 0 tf>)(r) = A^(r), 0 e S 0 , 

and that H 0 is the only self-adjoint operator having this property. 
On wave functions $(k) e U F @ T CL 2 (U 3 ) in the momentum space, 
H 0 acts in the following manner: 

(5.2) (//«<A)~(k) = ^ *A(k), kV(k) e L (2) (R3). 


* See Rys [1965] and Taylor [1972, Chapter 22] for a general treatment of the scattering 
theory of two identical particles. Naturally, for particles of nonzero spin one has to work 
on direct sums of L 2 (R 3 ) spaces [see (7.9) in Chapter IV]. 
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The above operator is obviously self-adjoint and, since it is a function 
of the momentum operators P {x \ P iy) , P iz) f it immediately follows that 
it has a pure continuous spectrum 

(5.3) S* = Sf<> = [0, H-oo). 

Thus, its resolvent 

(5-4) 

is defined for all values of £ e C 1 . Moreover, U 0 (£) * s a bounded linear 
operator defined on the entire Hilbert space Jf 7 = L 2 (IR 3 ) whenever 
Re £ < 0 or Im £ ^ 0, i.e., when the argument arg £ of £ is within the 
open interval (0, 2tt) and for such values in scattering theory —i? 0 (£) 
called the free Green's operator . 

Theorem 5*1* The Green’s operator G 0 (£) of the free Hamiltonian 
(5.2) is an integral operator 

(5.5) (G o (£)0)(r) = -(R o m(r) = f G 0 (r, r'; £) 0(r') dr' 

4 3 

for values of £ restricted to the range 0 < arg £ < 277. The kernel 
G 0 (r, r'; £), called the free Green function, is 

(5.6) G 0 (r, r'; 0 = 2?r| ~™ f> | exp[i v^TI r - r' |], 
where \/2m£ is the square root for which Im y/2 m£ > 0. 

Proof. According to (5.2) and Theorems 2.5-2.1 in Chapter IV, 

(5.7) (*o(OlA)~(k) = W 

Assume that the wave function */r(r) in the configuration space belongs 
to ^(U z ). Then ifs(r) and all its derivatives have Fourier transforms 

(5.8) iwmr*) = (2»)-» /„«-«' m *'• 

which by Theorem 4.5 of Chapter III are continuous and bounded, and 
therefore j/f(k) e ikT is integrable* on U z . Since (k 2 /2m — £) _1 is bounded 

# Throughout this section, whenever we deal with integration, we have in mind 
integration with respect to the Lebesgue measure, except if otherwise explicitly stated. 



522 


V* Quantum Mechanical Scattering Theory 


in k over [R 3 when 0 < arg £ < In, the inverse Fourier transform of 

(k 2 /2m - £)~V(k) 

exists. By virtue of (5.7), (5.8), and Lebesgue’s dominated convergence 
theorem, 

(5.9) WOSXr) - ^ Hm (■£- - t)“ «» «✓). 

Due to the fact that i/r(r') is of compact support, the function 

Id-fe^-o] m 

is Lebesgue integrable on IR 3 X {k: | k | < i?} with respect to the 
variables r and k. Hence, we can apply Fubini’s theorem to interchange 
the order of integration in (5.9): 

( 5 . 10 ) (R o m(r) 

= T^sr f dr'ifj(r') lim f dk (-^-f) exp|>*k(r — r')]. 

(2tt) 3 J K 3 7 r-»co J, k |<K V 2m J v 





k 2 /2m - C 


To carry out the above integration in k, let us write (for fixed 
p = r — r') the variable k in spherical coordinates introduced in such 
a manner that 

k(r — r') = kp = kp cos 0. 

It is very easy to carry out immediately the integrations in 0 < 9 < tt 
and 0 < </> < 2 tt\ 


< M1) 


{*R r 71 r^TT / A2 \— 1 

= j dk j dd j d<f,k* sin e (-—- - 

. r R k sin kp . . r +R & sin kp 

- -*• J. rfp;*. - o * = J 


: k 2 - 2m£ 


dk . 


In the last of the above integrals we have to integrate with respect 
to k g [— R f +/?] the function 


k sin kp k e ikp k e~ ikp 

& - 2mi ~ 2i(k - V2mQ(k + Vl m£) _ 2i(k - V2mQ(k + Vlmt) 


(5.12) 
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and then let R—> oo. If we work with the complex variable k e C 1 , 
the above functions are analytic everywhere in the complex plane, 
except for two simple poles at + \/2 m \/£, Im a/£ > 0. Let us close 
the contour of integration by semicircles in the upper-half and lower-half 
complex plane, respectively, when integrating on the real line the first 
and second function on the right-hand side of (5.12). Since the con¬ 
tributions from the respective integrations on the semicircles vanish in 
the limit of their radii R becoming infinite, we easily determine, by 
using Cauchy’s integral formula and computing the respective residua: 


.. f +/? &sin&/o „ If ke ikp „ If k e~ ikp „ 

R™ k 2 — 2 ml 2 i J k 2 — 2ml, 2i J k 2 — 2ml, 


k e ikp 


+ 77 - 


k e~ ikp 


k + a/ 2^T Lvssc k ~ 


k——V2mC 


= _ ^ipVZmt 


Inserting this result in (5.11), we get from (5.10) 


(5.13) CRo(0</.)(r) = ~ j f + f | exp(« \/2m'Q r' - r |) 0(r') dr' 


for all ifs e ^(R 3 ). Since ^°(R 3 ) is dense in L 2 (R 3 ) (see Chapter II, 
Theorem 5.6), while both R 0 (Q and the integral operator with the 
kernel (m/27 t)\ r — r' | _1 exp(iV2m£ | r — r' |) are bounded linear oper¬ 
ators defined on the entire Hilbert space L 2 (R 3 ), we infer with the help 
of the extension principle (Chapter III, Theorem 2.6) that (5.13) is true 
for all p eL 2 (R 3 ). Q.E.D. 

It follows from (5.6) that the free Green function has the following 
symmetry properties: 

(5.14) G 0 (r, r'; Q = G 0 *(r, r'; £*), 

(5.15) G 0 (r, r'; £) = G 0 (r', r; £) 

for any £ e C 1 not on the positive real axis. 

We note that considered as a function on R 6 , G 0 (r, r'; £) is continuous 
and infinitely differentiable everywhere except on the set 

(5.16) D Gq = {(r, r'): | r —- r' | = 0}, 


which is of Lebesgue measure zero on R 6 (see Exercise 5.1). Further- 
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more, 

(5.17) G^r, r'; k 2 j2m) = lim G 0 [r, r'; ( k 2 /2m ) ± ie\ 

= [—w/(2tt | r — r' |)] exp(±*& | r — r' |) 

also exists pointwise for any k ^ 0 and all (r, r') e IR 6 — D Gq , and by 
Lebesgue’s dominated convergence theorem (4.66) is certainly satisfied 
in the present context for all /(r), g(r) e ^^((R 3 ). Consequently the 
retarded and advanced free Green functions Go +) (r, r'; A) and Go _) (r, r'; A) 
do exist for all A e Sf° = [0, +oo). These functions inherit the sym¬ 
metry property (5.14), and we have 

(5.18) G«(r, r'; A) = G< fc) (r'- r; A) = Gf >*(r, r'; A, 
for all A ^ 0, as is easily verified by the use of (5.17). 

5.2. Partial Wave Free Green Functions 

As we have shown in §4.6, the free Green function G 0 (r, r'; £) is a 
key ingredient of the Lippmann-Schwinger equations (4.67) used in 
computing distorted plane waves In such computations one 

implicitly works with the complete sets of observables Q and P, as 
reflected by the fact that the plane waves <Z> k (r) are functions of the 
position variables r, and are parametrized in terms of the momentum 
variables k. 

We shall see in §7.4 that when the potential is central (i.e., spherically 
symmetric) it can be advantageous to work with the complete sets 
{| Q |, 0 q , d> Q } and {| P |, | L |, L {z) } instead. It is obvious that H 0 
commutes with | P |, | L |, and L {z \ and in fact, in accordance with 
(4.39), 

//„ = 4,(|PUL|,L ( ‘>) = |P| 2 /2 m. 

Since H 0 has no point spectrum, by (4.42) and (4.63), 

C„(,, o = Urn. ^ Y-(^ < ) r ' > *•<*• '■ ”>■ 

Using spherical coordinates (r, 6 , cf>) for r, and taking into consideration 
(4.26) and (4.40), we arrive at the following result. 

Theorem 5*2* The free Green function equals a sum (convergent 
in the mean), 

(5.20) G 0 (r, r'; £) = | £ G 0 \r, r'; 0 Yr(9, <f>) f )*, 

1=0 m=-i 
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of the partial wave free Green functions 


(5.21) 



Ji(kr)ji(kr') 
l — (£ 2 /2 m ) 


k*dk 


( —ImiKj^xr) h^\ K r'), 
\ —ImiKj^Kr') h^\icr) 9 


r < r' 
r > r\ 


where k = (2mQ 1 / 2 , Im k > 0. 

Upon noting that the integrand in (5.21) is an even function, the 
computation of the previous explicit expression for G 0 *(r, r'; £) proceeds 
by contour integration and the use of the spherical Hankel functions 
h ( i j2) (kr). Indeed (see, e.g., Butkov [1968]), as | k | —> oo in the complex 
plane, 

Hkr') = Wifikr') + h^\kr')} 

~ (2ikr')-'{z*v{i[kr' - |(M1} - exp{-^r' - i(h)]}, 


so that upon inserting this result in the integral in (5.21) we see that 
for r < r\ ji(kr)h { ^\kr') and h^ikr^^kr') converge to zero as | k | —> oo 
in the upper-half and lower-half complex plane, respectively. Hence, 
proceeding as we did from (5.12) to (5.13), we close the contour of 
integration appropriately, and using the residuum theorem arrive at the 
first of the explicit expressions in (5.21) upon noting that j x (—kr) = 
(—1 ) l ji(kr) and h\ 2) (—kr) = (—1 ) l h\ 1) (kr). For r > r' the roles of r 
and r ' in the above procedure are obviously reversed, and the second 
expression immediately follows. 

In (5.21) taking the limits Im £ —> ±0 for fixed Re £ ^ 0, we arrive at 


(5.22) Gj, ±)l (r, r'; k*/2m) 


Tlmikj^kr) k\ l) (±kr’), r < r\ 

^Imikj^kr') h { l 1) (±kr), r > r'. 


The two functions in (5.22) are obviously the advanced and retarded 
partial wave free Green functions . 


5.3. Fredholm Integral Equations with Hilbert-Schmidt 
Kernels 

One of the main significant features of the free Green function 
G 0 (r, r'; £) is that it provides an integral equation for the full Green 
function G(r, r'; £), corresponding to the Schroedinger operator H , 
for which 


(5.23) 


{m){r) = —(l/2m) d0(r) + V(r) 0(r), 0 e . 
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This integral equation is the second resolvent integral equation (4.76), 
which in the present case becomes 

(5.24) G(r, r'; £) = G 0 (r, r'; £) + f G 0 (r, r'; Q V(r") G(r", r'; Q dr". 

This equation can be solved by the Fredholm method if its kernel 

(5.25) K{ r, r n \ £) = G 0 (r, r"; £) V(r”) 
is of Hilbert-Schmidt type.* 

Before attacking the problem of solving (5.24), let us first review some 
basic facts about integral operators with Hilbert-Schmidt kernels. 

Definition 5*1* The kernel K(oc f a'), a, a' e (R w , of the integral 
operator K on L 2 (IR m ), 

(5.26) (Kifj)(oc) = f K(oc 9 a') 0(a') d™*', 0 eL 2 ((R™), 

J u m 

is a Hilbert-Schmidt kernel (or L 2 kernel) if | K(oc , a')| 2 is integrable on 
R 2w : 

(5.27) (K | K\ = f | K{* 9 a')l 2 dm <* dm * < +oo. 

J052m 

A Hilbert-Schmidt kernel which is continuous and such that K(oc , oc) 
is integrable on U m is called a trace-class kernel . 

The terminology for kernels of integral operators is related to the 
terminology for the corresponding operators in a straightforward 
fashion: integral operators with trace-class kernels are of trace class* 
with Tr K = J K{a , a ) d m a , and integral operators with Hilbert- 
Schmidt kernels are Hilbert-Schmidt operators (see Exercise 5.3). 

The Fredholm integral equation is 

(5.28) 0(a) =/(<*) + oj f K(ot, a') 0(a') d m ot', 

J R m 

where a> is, in general, some complex number. If we search for solutions 
0(a) which are Lebesgue square integrable, so that K(oc, a') is the kernel 
of an integral operator on L 2 (IR m ), and if /gL 2 ((R w ), then the integral 
equation (5.28) can be written in operator form: 

(5.29) 0 = / + <wK0. 

* A condensed and very lucid presentation of the Fredholm theory for Hilbert-Schmidt 
kernels (L 2 kernels) is given by Smithies [1965] in Sections 6.S-6.7. 
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Hence, (5.28) has a unique solution inL 2 (U m ) if and only if (1 — cdK )~ 1 
exists and if / is in the domain of definition of (1 — coK)- 1 . 

For co ^ 0, (1 — ojK)- 1 is related in a straightforward manner to the 
resolvent R K (£,) of the operator K: 

(1 - o>K)-' = — - kY 1 = --R k (—). 

If K(a, ol) is a Hilbert-Schmidt kernel, then K is a Hilbert-Schmidt 
operator, and as such, K has a pure point spectrum. Hence, (1 — uK)* 1 
will exist if and only if 1 /co 4 S*. When this is the case, the domain of 
definition of (1 — co.K’) -1 is the entire space L 2 (IR W ). 

The Fredholm theory essentially provides a means for the com¬ 
putation of (1 — coK) -1 from the kernel K{oc , a'). The main results of 
this theory for L 2 kernels are contained in Theorems 5.3 and 5.4. The 
reader interested in the proofs of these theorems is advised to consult 
Smithies [1965], Theorems 6.5.1-6.7.1. 


Theorem 53* Let K{oc , a') be a Hilbert-Schmidt kernel and let us 
introduce the modified Fredholm determinant of K(a, a) 


0 K (, a 2 ) - Kfa,*") 


(5.30) 



% 5 «i) 0 ••• A:(a 2 ,a n ) 

F-ip^n > a i) F(oc n , a 2 ) ••• 0 


d m <x ••• d m oc n , 


and the modified first Fredholm minor of K(a, a) 


(5.31) 


K(«, «') K(a, oj 
0 


K(<*, «») 


^(“l , “») 


d m o^ —d m <x n , 


I K{<*n , “') K{<x„ , o^) ••• 0 I 

for n = 1, 2,...; for n = 0, d 0 = 1 and D 0 ( a, a!) = i£(a, a'). In that 
case the series 


(5.32) 


oo 


71=0 


is convergent for all complex values of to. Likewise, for any fixed 


* A proof of this statement can be based on Mercer’s theorem (see Riesz and Sz. Nagy 
[1955, Section 98]), or it can be deduced from (8.10) in Chapter IV. 
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a> E C 1 , the series 

(5.33) Z)(a, a'; a>) = ]T Ai( a > a')^ n 

n —0 

is convergent in the mean [in L 2 ([R 2w )] for all o>. Moreover, the function 
D(a, o>) is a Hilbert-Schmidt kernel for any fixed complex o>. 

Theorem 5*4* If d(oj) ^ 0 and fEL 2 (U m ), the integral equation 
(5.28) has the unique Lebesgue square-integrable solution 

(5.34) 0(a) =/(«) + f D(oc, a'; <o)/(a') </™a'. 

d[oj) J R « 

Theorem 5*5* The function d(co) vanishes if and only if co*" 1 is an 
eigenvalue of the integral operator K . 

5.4. The Full Green Function 

We can apply the Fredholm theory to the integral equation (5.24) 
when its kernel (5.25) is of the Hilbert-Schmidt type. Under these 
circumstances, (5.24) is a special case of the integral equation (5.28), in 
which the variables a, a! e U m become r, r" e IR 3 , and o> = 1. More¬ 
over, we note that since in (5.6) £ is not on the positive real axis, 

(5.35) | G 0 (r, r'; Q\ = ^ f , exp(-Im VW | r - r' |) 

is square integrable in r or r' because according to Theorem 5.1 we 
have to choose the square root of £ for which Im \/£ >0. Hence, for 
any fixed r", (5.24) has a unique solution, provided that d( 1) ^ 0 for the 
given kernel (5.25). 

To establish that d( 1) is not zero for the kernel K{ r, r'; £) in (5.25), 
note that K( r, r'; £) is the kernel of the closed operator 

(5.36) = (t-H 0 )-'(H-H 0 ). 

According to Theorem 5.5, d{ 1) = 0 if and only if co = 1 is an eigen¬ 
value of this operator, i.e., if there is a nonzero 0 X for which 


<(£ - H 0 )* | = <(£ - HM | G o (£)i/ 1 0 1 > = I 0> 

for all iff e Sj h == Sj Hq C S) Hx . However, the above relation implies 
that Hifs 1 = £0 X . Thus, as long as £ does not belong to the spectrum 
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of H , the Green function G{ r, r'; £) of H exists, and can be computed by 
means of (5.34) with co = 1 if K is Hilbert-Schmidt. 

Let us find out now the restrictions imposed on the potential V(r) by 
the demand that K(r, r'; £) be square integrable. Applying Fubini’s 
Theorem, we find that 

(5.37) f | K(r, r'; £)l 2 dr dr' = f dr'\ F(r')|* f dr\ G 0 (r, r'; £)| 2 . 

J u •'r •' r 3 

Using the expression (5.6) for G 0 (r, r'; £) we easily compute that 

(5.38) Jr* I G °( r ’ r ’ * = ^ Vt" ImVt ’ In,Vf>0 ' 

Thus, we get the relation 

(5.39) J a- 1 K(t, r’; {)!•**' - f | 

which shows that G 0 (r, r',* £) V(r') is for Im J^Oa kernel of Hilbert- 
Schmidt type if and only if V(r) is square integrable on IR 3 . 

The main conclusions of the above discussion are contained in 
Theorem 5.6. 

Theorem 5*6* If the potential V(r) is square integrable on 
IR 3 , then the full Green function G(r, r'; f) of H [where HD — 
(ll2m)A + V(r)] exists for f £ S H and can be written in the form 

(5.40) G(r, r'; Q = G 0 (r, r'; Q + j D(r, r"; 1) G 0 ( r", r'; £) dr", 

where d( 1; £) and Z)(r, r"; 1) are obtained by taking 

(5.41) K(r,r'U) = G 0 (r 9 r';()VW 

as a kernel in (5.30)-(5.33). 

5.5. Fredholm Expansion of the Full Green Function 

The kernel D( r, r"; 1) of the integral operator in (5.40) is given 
by the infinite series 

(5.42) D( r, r"; £) = K( r, r"; Q + £ Z) n (r, r"; J), 



530 


V* Quantum Mechanical Scattering Theory 


in which by (5.31) 


D n (r, r"; Q 


K(r, r"; £) K( r, r i; .Q 


K(r, r„; 0 


( 5 . 43 ) 


(- 1 )” (• 
ti\ J 


K(r t , r'; 0 0 

0 K(r n ',t i: Q 


•K(ri, r„; Q 


dr x ••• dr n . 


0 


This would imply that the full Green function can be expanded in a 
series 

( 5 - 44 ) G(r, r'; Q = f G„(r, r'; £), 

n=0 

where G 0 (r, r'; £) is the free Green function, and 

( 5 . 45 ) G„(r, r'; £) = —f ^(r, r"; £) G 0 (r", r'; £) dr" 

4; •'r 3 

for n = l, 2 ,..., if the order of integration and summation can be 
interchanged. To prove that this is indeed the case, we need Lemma 5.1. 


Lemma 5*1 ♦ If K(oc, a) is a kernel of Hilbert-Schmidt type, then 
the Hilbert-Schmidt norm 

(5.46) II D n || 2 = | D n (oi, <x')| 2 d m ct d m «| 

of the modified first Fredholm minor D n {oc , a) satisfies the inequality* 

(5.47) || D n || 2 < ir-'V* || K H^+S n = 1, 2,..., 

and the modified Fredholm determinant d n satisfies the inequality 

(5.48) \d n \^{(eln)v*\\K\\ 2 }", n= 1,2,..., 

where || K\\ 2 is the Hilbert-Schmidt norm of the integral operator K f 
with the kernel K(oc , a'): 

(5.49) || XU, = jj 2 J K(a, a , )| 2 d*» a d*»o I 'J 1/2 . 

The proof of the above result can be found in the work of Smithies 
[1965, Section 6.5]. 
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Applying in (5.45) the Schwarz-Cauchy inequality at fixed r, r' e IR 3 
and using afterwards (5.47), we get by (5.38) 

(5.50) \d{\-Q?\ dr | G„ +1 (r, r'; £)| 2 

<f dr\\ | D n {r, r"; t)\* dr"]\ ( \ G 0 (r", r'; £)l 2 dr"] 

J \& 3 LJ [fS 3 J L%, 05 3 J 

<w _ n r^T m{e(K 1 K} 2 } n+1 
^ V 2 277 Im vT~ 

Due to n~ n this expression converges very rapidly to zero as n —► + oo, 
so that we can state the following theorem. 

Theorem 5*7* If V(r) is square integrable on IR 3 , then the full 

Green function G(r, r'; £), £ $ S H , is square integrable in r e R 3 for 

almost all r' e R 3 , and the series (5.44) converges in the mean with 
respect to the variable r e R 3 to G(r, r'; £) for all such r' e R 3 . 

5.6. Symmetry Properties of the Full Green Function 

The two symmetry properties (5.14) and (5.15) of the free Green 
function are inherited by the full Green function, i.e., for ^S H 

(5.51) G(r, r'; £) = G*(r, r'; £*), 

(5.52) G(r, r'; £) = G(r', r; £) 

holds almost everywhere in R 6 . 

The first of these two relations is a straightforward consequence of the 
formulas (5.42) and (5.44) for G(r, r'; £), resulting from the fact that 
V(r') is real, and consequently 

K{ r, r'; £) = G 0 (r, r'; £) V(r') = G 0 *(r, r'; £*) V(r') = K*( r, r'; £*). 

To prove (5.52) we have to resort to a lengthier argument. First of all, 
we have to note that the function 

A(r, r'; £) = f D(r, r"; 1) G 0 (r", r'; £) dr" 

J R 3 

is the kernel of a Hilbert-Schmidt operator. This follows from the fact 
that G 0 (r", r'; £) is the kernel of the bounded operator (£ — i/ 0 ) _1 , 
while D{ r, r"; 1) is a kernel of the Hilbert-Schmidt type, since by 
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(5.47) and Theorem 5.3 it is the limit in the mean of the Hilbert- 
Schmidt kernels (see Exercise 5.4) 

£ D„(r,r';l), AT =1,2,... 

n=0 


Consequently, D{ r, r"; 1) is the kernel of a Hilbert-Schmidt operator 
Z)(l). Thus, S x (r, r'; £) is the kernel of the operator Z)(l)(£ — i/ 0 ) _1 > 
which is the product of a Hilbert-Schmidt operator and a bounded 
operator, and therefore is of Hilbert-Schmidt type (see Exercise 5.5). 

The function f*(r)g(r f ) is square integrable in IR 6 whenever 
/, g eL 2 (R 3 ). Since /^(r, r'; f) is a kernel of Hilbert-Schmidt type, it is 
also square integrable in IR 6 . Hence, the function/*(r) J\(r, r'; l)g(r') 
is integrable in IR 6 . Consequently, Fubini’s theorem can be applied 
to infer that 


C drf*(r) f dr' J\(r, r'; t)g(r') 

J J nr 

= f dr' g(r') I" dr /^(r, r'; Qf*(r). 

It can be seen from (5.6) that a corresponding relation holds for 
G 0 (r, r'; £). Consequently, in view of (5.40), we have 

(5.53) f drf *(r) f dr' G( r, r'; Qg(r') 

J M 3 J U 3 

= f dr' g(r') f drG(r,r';Qf*(r). 

On the other hand, the relation 

(f 1 H-£ gS ) = ( H - £* ^ k)’ ^ S * 

in conjunction with (5.51) implies that 

(5.54) f drf *(r) f dr' G(r, r'; Og(r') 

= f dr' g(r’) f dr G*(r’, r; £*)/*(r) 

•'or J or 

= f dr'g(r') f dr G(r', r; Qf*(r). 

•> DT J 05 3 
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Comparing (5.53) and (5.54), we see that 

f 3 dv ' g( r ') f drG(r, r';£)/*(r') 

= f dr'g(r’)j drG(r',r;0f*(r) 

J R 3 •'M 3 

for all square-integrable functions £(r'). This is possible if and only if 
f G(r, r'; £)/*(r) dr = f , G(r', r; £)/*(r) dr 

J R 3 •'R 3 

for almost all r' e R 3 . Since the above equality holds for all square- 
integrable functions/(r), we infer that (5.52) holds almost everywhere 
in R 3 X R 8 = R 6 . 

This completes the proof of (5.52). 

*5.7. Appendix: The Spectrum of the Schroedinger Operator 

We have seen already that if K(r , r'; £) is a Hilbert-Schmidt kernel 
the full Green function exists as long as £ £ S H . This is certainly the 
case if Im £ ^ 0. The following theorem shows that this will be also 
the case if £ is real but negative, with the possible exception of a finite 
or at most countably infinite number of values. 

Theorem 5*8* Suppose the potential V(r) is locally square 
integrable and V(r) 0 as r —> oo. Then the continuous spectrum 
of the Schroedinger operator H'D —(l/2m)A + V(r) contains no 
negative values; moreover, there are only at most countably many 
negative eigenvalues, all of finite multiplicity, * and having no negative 
number as an accumulation point. 

In order to prove the above theorem, we need a number of auxiliary 
results. Since these results are of interest in themselves, we shall present 
them as lemmas and theorems. 

We recall from §3 that if H and H 0 are any two operators (not 
necessarily self-adjoint) with identical domains of definition and £ 
belongs to their resolvent sets, then the resolvent 7?(£) of H can be 
expressed in terms of H x = H — H 0 and of the resolvent 7? 0 (0 Ho 
by making use of the infinite series in (3.68), provided that this series 
converges in some sense. Lemma 5.2 represents, to some extent, a 
converse to this result, valid under the additional assumption that 

# Recall that the multiplicity of an eigenvalue is the number of linearly independent 
eigenvectors corresponding to that eigenvalue. Local (square) integrability means (square) 
integrability over arbitrary bounded Borel sets. 
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H X R 0 (£) has a bound less than one; in this context, it should be noted 
that when 2 H == 2 U , the operator is defined everywhere on 

since m) is defined everywhere on and it maps any f e 3#* 
into /? 0 (£)/ e @h 0 — ®h x > so that Hi-R 0 (0 f is defined. 

Lemma 5*2* Suppose that H and H 0 are two operators (not 
necessarily self-adjoint) with identical domains of definition 3f K = Z& Ho > 
and that the complex number £ belongs to the resolvent set of H 0 . If 
|| J/ 1 /? 0 (£)|| <1, H 1 = H — H 0 , then £ belongs also to the resolvent 
set of H , and 

(5.55) *(£) = *o(£) u-Jim £ (-1 ) fc (// 1 /? 0 (£)) fc . 

Proof. Since || // 1 i? 0 (£)|| < 1, the uniform limit in (5.55) exists 
(see Exercise 5.7). For any feSf H we have 

[Ro(0 t (-i)W*«©)*| (H - Of 

= R, (Q(H 0 - Of - R 0 (0 HMXHo - Of + ••• 

+ (-irR 0 (OH 1 -R 0 (0(H 0 -Of 
+ R 0 (0 HJ - R 0 (0 H,R 0 (0 HJ+ - 

+ (—l) nR o(0 •" R o(0 H if 

= f+ (-1 W) H, - R 0 (0 HJ—r+f. 

This shows that if A(Q denotes the operator on the right-hand side of 
the equality (5.55), then we have — £)/ = / for all fe2 H . 

Since @J H = and 7? 0 (£) has the range , the operator 

(H — £) 7? 0 (£) is defined everywhere on Jf?. Consequently, we can 
write 

(H-ORoiO ti-muM))* 

fc =0 

= ( h 0 - o R^m - h,r 0 (o + - + (-im* 0 (o)«] 

+ //MO - HJUO H i R o(0 + - + (-i)«r 

= i + (-i)w)r ==> i- 

n-^+co 

Thus, we also have (H — Q A(Q = 1, i.e., A(Q is identical to 
(H - £)-i - *(£). Q.E.D. 

If A is an unbounded self-adjoint operator, then its spectrum S A is an 
unbounded set. However, the set S A C U 1 could be still bounded from 
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above or from below, in which case we shall say that A is bounded from 
above or from below , respectively. In the case where A is bounded from 
below, we denote by m A the greatest lower bound of the set S A : 

m A = inf A. 

\eS A 

We note in passing that A is bounded from below if and only if the set 

{\\ft 2 <f\Af>:fe® A ,f^0} 

is bounded from below, with the same greatest lower bound (see 
Exercise 5.8). 

Theorem 5*9 ♦ Suppose H and H 0 are two self-adjoint operators 
with identical domains of definition, that H 0 is bounded from below, 
and that for some 0^a<l,6^0, 


(5.56) || HJ || < a|| H 0 f\\ + *||/|U H x = H - H 0 , 

for all / e Hq = S> H . Then H is also bounded from below, and 

(5.57) m H m Hi> — max ’ h + a \ m » 0 I j • 

Proof . We shall prove the main statement of the theorem by showing 
that for any real 

(5.58) £ <»*//„ — m ax > b + *1 m H„ I j, 

we have || // 1 Z? 0 (£)|| < 1, and consequently, according to Lemma 5.2, 
such a £ e U 1 belongs to the resolvent set of H. This means that the 
spectrum of H contains only such points which do not satisfy (5.58); i.e., 
it is bounded from below and its greatest lower bound m H satisfies (5.57). 
To prove that || /!/ 1 2? 0 (^)|| < 1, we use (5.56) to derive the inequality 

(5.59) || tfi* o a)|| < *|| HM) || + b\\ * 0 (£)||. 

Since S^ 0 C \m R ^, +oo), and therefore for any /e and real £ < m Hb , 


n^o(o/ii 2 = r° y\-f\2 

J I A 4 1 



ll/ll 2 

I - £ I 2 ’ 

{equation continues) 
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i w ii 2 = J^t a-V <*» 


< SU P TT A "f |g f „ d W f = SUP 

XeS H o \ A ~ t \ J S h o 


A 2 I|/I | 2 


£+» I A - £ |« ’ 


we conclude that 


I *o(9ll < 


l 


m H„ — i 


| H 0 R 0 (Q || < sup 

AeS H <t 


A-{ 


< max Ji 




Combining the above two inequalities with (5.59), we obtain 

(5.60) || iWO II < max L + m h . • 

Since it is easily seen that the expression on the right-hand side of 
the inequality (5.60) is smaller than one when £ satisfies (5.58), it follows 
from Lemma 5.2 that such £ is in the resolvent set of H. Consequently, 
H is bounded from below by the expression on the right-hand side of 
(5.57). Q.E.D. 

We recall from §7 in Chapter IV that when H = T s + V s is the 
Schroedinger operator with a potential satisfying the conditions of 
Theorem 7.4 in Chapter IV, then Q) H = (Chapter IV, Theorem 7.5) 
if H 0 = T s is defined by (7.15) in Chapter IV and (5.56) is satisfied 
by any 0 < a < 1. Hence, we can state Theorem 5.10. 

Theorem 5*10* The n-body Schroedinger operator H = H 0 + F, 
W)“ (Pl v... Pn) = t -j£- ft Pi .-. P«). 

k =1 ^ 

(5.61) 

(F0)(r,..., r n ) = F(ri,..., r n ) 0(r x ,..., r n ), 

corresponding to a potential F(r x ,..., r n ) satisfying the conditions 
of Theorem 7.4 in Chapter IV, is bounded from below. 

As a by-product of the above theorem and (5.57), we can obtain 
estimates on the lower bound of an n-body Schroedinger operator H 
by computing the constants a and 6, as was done in the proof of Theorem 
7.4 in Chapter IV. 

A single-particle Schroedinger operator H with a potential satisfying 
the requirements of Theorem 5.8 also satisfies the conditions of Theorem 
7.4 in Chapter IV. Hence, such a Hamiltonian H is bounded from below. 
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The second resolvent equation 

(5.62) (H - A)-* = (H 0 - A)" 1 - (H 0 - A )~W{H - A)-* 

holds for any value of A in the resolvent set, and therefore, in particular, 
for any real A < m H . The desired result about the spectrum of H , 
stated in Theorem 5.8, can be then obtained from Lemma 5.3, which 
will be derived with the help of the next theorem, which implicitly 
introduces the following classification of the spectrum of a self-adjoint 
operator. 

Definition 5*2* A point A 0 in the spectrum S A of a self-adjoint 
operator A is a limit point of S A if it is either an eigenvalue of infinite 
multiplicity or an accumulation point of S A . The set Sf of all limit 
points of S A is called the essential spectrum of A , and its complement 
= S A — S A , i.e., the set of all isolated eigenvalues of finite multi¬ 
plicity, is called the discrete spectrum of A. 

^Theorem 5*11 (WeyVs criterion for limit points). A real number 
A 0 is a limit point of the spectrum of a self-adjoint operator A if and 
only if there is a sequence f x ,/ 2 ,... e Of A which converges weakly to 
the zero vector, and is such that \\f n || = 1 and 

(5.63) lim \\(A — Ao)/ n || = 0. 


Proof. Suppose that ||/ n || = 1, that w-lim n _> +O0 f n = 0, and that 

(5.63) is true. For an arbitrary open interval ( a 0 , b 0 ) containing A 0 we can 
write 

IM-A 0 )/nl| 2 = f (A — A 0 ) 2 d\\ Eff n || 2 
J u 1 

> (*o - A 0 ) 2 C d\\ Etf n II 2 + K - A 0 ) 2 f° d\\ Eff n II 2 

J b 0 J -co 

> 00 - A 0 ) 2 11(1 - K)fn II 2 + K - A 0 ) 2 II </ n II 2 . 

Now, the expression on the left-hand side of the above inequality 
approaches zero in the limit n —► +oo. Hence, we have 

lim || </J|=Hm ||(1-<)/J|= °, 


and consequently 


Hm || E A ((a 0 ,b 0 ])f n 


Hm ||«-<)/„ | 


lim||</„|r-Hm ||</„|| 2 =1. 
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This implies that the subspace E A ((a 0 , b Q ])Jf onto which E A ((a 0 ,6 0 ]) 
projects cannot be finite dimensional; in fact, if this subspace were of 
finite dimension N, we could select an orthonormal basis {e x e in 
it, and write 

II EA ((a 0 , b 0 ])f n || 2 = K>! |/„>| 2 + - + |<«* |/„>| 2 -> 0, 

since lim n _ +00 |/ n > = ••• = lim n _ +00 (e N |/ w > = 0 due to the weak 
convergence to zero of^ ,/ 2 Thus, we conclude that A 0 is either an 
accumulation point of S A or an eigenvalue with an infinite-dimensional 
characteristic subspace, i.e., that A 0 is a limit point of S A . 

Conversely, if A 0 is a limit point of the spectrum of A and D / 2 D 
are open intervals containing A 0 , then each E A (I n ) is the projector on an 
infinite-dimensional subspace of Hence, we can choose an infinite 
orthonormal system {e x , e 2 ,...} such that E A (I n ) e n = e n , n = 1, 2,.... 
Since we have 

I \(A - A,X || 2 = f (A - A„) 2 dH EUn II 2 < I /. I 2 II «. II 2 — 0 

when these intervals were so chosen that their lengths | I n | shrink to 
zero, and since lim n _* +00 < e n |/> = 0 for any/e Jf 7 because of Bessel’s 
inequality, we see that the three conditions of the theorem are satisfied 
by this sequence e x , e 2 ,... e Of A . Q.E.D. 

*Lemma 53^ If A and K are self-adjoint bounded operators on Jf 7 
and K is also completely continuous, then essential spectra of A and 
A + K coincide. 

Proof. If A 0 is a limit point of the spectrum of A , according to 
Theorem 5.11 we can find a sequence f x ,/ 2 ,... e such that ||/ n || = 1, 
w-lim m _ +00 f n = 0 and lim re ^ +03 1|(^4 — A 0 )f n || = 0. Since f x ,/ 2 
is a bounded sequence and K is completely continuous, it must contain 
a subsequence g x , g 2 ,... such that Kg x , Kg 2 ,... converges strongly to 
some vector h e However, since w-lim n _^ +00 g n = 0, we infer 
that || h || 2 = lim re _ +00 (h \ Kg n > = lim re ^ +00 (K*h | g n } = 0. Hence, 
linWoo \\( A + K ~ A 0 ) g n II = 0, which in conjunction with || g n || = 1 
and w-lim n ^ +00 ^ n = 0 implies, by virtue of Theorem 5.11, that A 0 is a 
limit point of the spectrum of A 1 = A + K. 

Conversely, if A x is a limit point of the spectrum of A x , then, according 
to the same argument applied to (— K), we deduce that A x is also a limit 
point of the spectrum of A = A x — K. Q.E.D. 

We can return now to the proof of Theorem 5.8, and observe first 
that since the spectrum of H 0 is [0, + oo), the spectrum of ( H 0 — A)” 1 , 
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A < 0, is [0, — 1/A], as seen from the relation 

E Ri > m (B) = E h °(F~\B)), F( A') = (A' - A)- 1 . 

The operator (A — i/ 0 ) _1 ^ is an integral operator with the kernel 
^(r, r'; A) = G 0 (r, r'; A)F(r'). To prove that the operator K( A) is 
completely continuous, we set V n (r) = V(r) for r ^ n and V n (r) = 0 
for r > n, and note that due to the local square integrability of V(r) 
the operator K n ( A) with kernel G 0 (r, r'; A)F n (r') is Hilbert-Schmidt. 
Since obviously 

|| K(X) - K n ( A)|| < ||(A - H 0 )-' || sup | F(r)| -> 0, n -> oo, 

r^>n 

we deduce from Theorem 8.3 in Chapter IV that i£(A) is indeed compact. 
For A < m H , the operator (H — A) -1 is bounded, and consequently, 
by Theorem 8.1(b) of Chapter IV, (H 0 — A) _1 F(i/ — A) -1 is also a 
completely continuous operator. Hence, according to (5.62) and Lemma 
5.3, the limit points of the spectrum of (H — A) -1 are the same as the 
limit points of the spectrum of (H 0 — A) -1 , i.e., they constitute the 
interval [0, —1/A]. This implies that the limit points of the spectrum 
of H constitute the interval [0, + oo), i.e., H has no negative limit points. 
This means that the continuous spectrum of H is contained in [0, + oo), 
that any negative eigenvalues of H must be of finite multiplicity, and 
that these eigenvalues have no accumulation point on the negative real 
axis. Thus, Theorem 5.8 is established. 

Theorem 5.8 deals with the point spectrum of the Schroedinger 
operator on the negative real axis. For the more restricted (but still 
very large) class of potentials V(r) for which there is an R 0 > 0 such 
that for all r ^ R 0 the potential V(r) is continuous and | r V(r)\ < const, 
it can be shown (see Kato [1959, Theorem 1]) that the Schroedinger 
operator has no positive point spectrum, i.e., no positive eigenvalues.* 
The reader should now recall that these results on the point spectrum 
of the Schroedinger operator are well illustrated by the case of the 
hydrogen atom treated in §7 of Chapter II. In that case we were dealing 
with a point spectrum of the form = {— a/n 2 : n = 1,2,...}, where 
a is a constant characteristic of the hydrogen atom. 

*5.8. Appendix: Relations between Resolvents 
and Spectral Functions 

We know already that if the spectral function E x of any self-adjoint 
operator H is given, then its resolvent R(Q = (H — £) -1 can be 

# See Iorio [1978] for recent results. General surveys can be found in Jorgens and 
Weidmann [1973], Heed and Simon [1978]. 
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computed by means of the formula 

(5.64) </1 R(Qg> = y^y- d<J | E A g>. 

It is not so obvious, however, that the converse of this statement is also 
true—namely, that the spectral function E x of H can be computed from 
the resolvent /?(£) (i.e., from the Green function of H , if it exists). 
To show that E x can always be computed from R(Q, we need the 
following lemma. 

Lemma 5*4* If the function F(X) has a continuous derivative F'(A) 
on [a, b\ , and is Riemann-Stieltjes integrable on [a, b\ with respect 
to ct(A), then 

(5.65) f & F(A) d x cr(A) = F(b) a(b) - F(a) a(a) - f V(A) a( A) d\, 

•'a "a 

where the integral on the right-hand side of the above relation is a 
Riemann integral. 

Proof. Taking A k ' = A k in the defining formula (3.18) and recalling 
that A o = a and A n — b, we get 

fV(A) d A a( A) = lim | £ F(A k ) a(A k ) - £ F(A k ) o(A M )| . 

Ja fco **- 1 ’ 

According to the mean value theorem of differential calculus, 

^(A ic) — ^(Afc-i) + ^ (Afc-i)(A fc — A^), A fc _ x ^ A k _ 1 A k . 

Consequently, we have 

(5.66) £ F(A k ) *(A k ) - £ F(A k ) o(A M ) 

fc=i fc=i 

= m °Q) - P(K) <*«) - t nh-i) a (K-i)(K - K-i)- 

fc—2 

Since-F'(A) is uniformly continuous on [a, 6], 

Hm f F'(X k _ i) - A w ) = JV(A) <*A) dX, 

5^+0 fc==2 a 

and the expression in (5.66) converges to the right-hand side of (5;65), 

Q.E.D. 
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Let us apply the result of the above lemma to the integral in (5.64). 
For Im £ ^ 0 this integral can be considered to be a Stieltjes integral, 
and since (A — £) _1 is continuously differentiable, we get 




<f\E b g) 

b-C 


<f\E ag y 

a-C 


f 6 <f\E,g) 

K (A-0 2 


dX. 


Letting a —> — oo and b —*■ + oo, we arrive at the result 


< 5 - 67 > •Cxh-*' 1 E “>-SZWw dK 

from which we can derive Theorem 5.12. 


Theorem 5*12* For any fixed fgsJ^, the function (f\R(Qg} 
is analytic at any complex £ with Im £ ^ 0, and for A x ^ A 2 

( 5 . 68 ) </1 (E, z + E^)g> - </1 (E h + E h _ 0 )g> 

1 ( -A 2 +«'e , -A 2 —ie \ 

= -y lim | </1 mg> dl-\ </ | R(t)g> dl\, 

TTl e-»0 ( J Ai+ie J Ai _ u ) 


where the above integrations should be carried out along any piecewise 
smooth curves which do not intersect the real axis. 

Proof. According to (5.67) the complex derivative 
d 'f \ R(t\r ' f + “ 2< f 1 Ea8> d\ 

H < f 1 R{0g> - J _ (A - 0 3 

exists at any point £eC 1 with Im £ ^ 0 since the above improper 
Riemann integral converges absolutely and uniformly in any closed 
neighborhood of £ which is disjoint from the real axis. Hence, < /1 i?(£) g > 
is analytic in the upper and in the lower complex plane. 

Due to the analyticity of </|i?(£)^) the integrals in (5.68) are 
independent of path. If we carry out the integration along straight lines 
and use (5.64) and (5.67), we get 

1 > = ir\ </i *(£)*> # - f </1 R (0g> di\ 

m J Ai+ic J Ax-w > 

= f {</1 R ( s + *«)£> - </1 R ( s ~ ie )g» ds 

m " Aj 


(equation continues) 
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The integrand in the last of the above integrals is Lebesgue integrable 
in s and A on the set [A x , A 2 ] x R 1 , since it is majorized by a function 
integrable on that set: 

<f\E*g> < WfWWgW 

(s - A ± ie? ^ (s - A) 2 + € 2 * 

Hence, the order of integration in s and A can be reversed and, after 
carrying out the integration in s , we obtain 


(5.69) 


i *+oo | 

: J-oo i(A^~ 


77 }(A^A 2 ) 2 + € 2 (A — Ajl) 2 + € 2 


</1 E xg y dX. 


Upon splitting the interval of integration into (— oo, X t ) and [A t -, 
4-oo), i = 1, 2, whereupon Lemma 4.2 can be applied separately to the 
first and second term in the above braces, we arrive at the relation 

ikn/ c - </1 E^gy + </1 E Xl _ og y = </ 1 E^gy - </ 1 E Xig y 9 

which is identical to (5.68). Q.E.D. 

That the above theorem provides an important means of computing 
the spectral measure of a self-adjoint operator from its Green function. 


Exercises 

5*L Prove that the set D Gq defined in (5.16) is a Borel set of Lebesgue 
measure zero in R 6 . 

5*2* Assuming that J K 2 » | K(oc 9 a')| 2 dfJi(oc) d[i( a) < +oo, show that 
the adjoint K* of the operator ( Kf)(oc ) = J K » K(oc 9 a')/(a') d^{oc) on 
L 2 (R W , fi) is also an integral operator with the kernel (i£*)(a, a') = 
K*(oc ' 9 a), and that the kernel (jK’*jK’)(a, a') of K*K is equal to 
J K a K*(oc " 9 oc) K(oc\ oc') dfJi(oc"). 

53. Prove that any integral operator K on L 2 (R n , n) 9 which has an 
L 2 kernel K(oc 9 oc') 9 is a Hilbert-Schmidt operator, and that 

Tr (K*K) = f | K(oc 9 oc') I 2 dfjL(oc) dn(oc'). 

J R 2n 

5*4* Give the reason why the limit in the mean if(a, oc') of a sequence 
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of Hilbert-Schmidt kernels K n { a, a'), 

lim f | K(a, a') - K n (a, *’)\* d^a) d^a') = 0 

is a kernel of Hilbert-Schmidt type. 

5*5* Show that if A is Hilbert-Schmidt operator in a separable 
Hilbert space and B is a bounded operator then BA is a Hilbert-Schmidt 
operator. 

5*6* Let A x , A 2 ,... be a sequence of bounded operators for which 
£fc=i II || < +oo. Prove that the uniform limit 


n 



exists and is a bounded operator with 

Mil < f IM*II. 

k =1 

5.7. Use the results of Exercise 5.6 to show that 


u-lim f c k A k 

*7->4-cn " 


exists if || A || < 1 and 1^1 ^ M for all k = 1, 2,.... 

5* 8* Prove that the self-adjoint operator A is bounded from below 
if and only if < /1 Af) > m A ||/|| 2 for all/6 S> A . 


6. Distorted Plane Waves in Potential Scattering 

6.1. Potentials of Rollnik Class 

In §5.5 we have learned that for square integrable potentials the 
equation (5.24) could be solved by the Fredholm method, and on first 
sight it might appear that exactly the same method is applicable to the 
full advanced and retarded Green functions G (±) (r, r'; A), A e , that 
are required in the solution-type Lippmann-Schwinger equations (4.74). 
Indeed, as discussed in §5.8, under reasonable conditions on V(r) 
(which from now on we shall take for granted) we have 

(6.1) Sf C = [0, oo), Sp n [0, <x>) = 0 , 

and formally taking in (5.24) and (5.25) the limit Im £ —*■ ±0 at fixed 
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A = Re £ e S C H we obtain 

(6.2) G w (r, r'; A) = G^(r, r'; A) + f K (±> (r, r'; A) G (±) (r", r'; A) dr", 

(6.3) iC (±) (r, r'; A) = G^(r, r'; A) F(r'). 


Unfortunately, the kernels in (6.3) are not Hilbert-Schmidt since 


(6.4) 


| «<*»(r, r'; A)| 2 


m 2 | F(r')| 2 
4 t 7 2 I r - r' I 2 


obviously is not integrable in r over R 3 . 

The Lippmann-Schwinger equation (4.67), which shares the same 
kernel with (6.2), 

(6.5) <P^(r) = & k (r) + f ^ (±) (r, r'; k 2 /2m) «£V) *' 

*'[® 3 


runs into exactly the same problem. 

The most frequently employed* way out of this impasse consists 
of factorizing V(r) as | V(r) | 1 / 2 F 1 / 2 (r), where 


( 6 . 6 ) 


V i /2M = I Hr) l 1/2 sgn V(r) if V(r) ^ 0, 

' ' {7} exp(—r 2 ), 7J > 0, if V(r) = 0, 


and then multiplying (6.2) or (6.5) from the left by F x / 2 (r). For example, 
in case of (6.5), upon setting 


(6.7) <P k (r) = r)0 k (r), $?\r) = V^(r) &^\r), 

the resulting equations can be written in the form 

(6.8) $t\r) = £ k (r) + f £ (±) (r, r'; k 2 /2m) <?£>') dr' 

J [® 3 


where the new kernels 


(6.9) £ (±) (r, r'; A) = V^Hr) G ( t\r, r'; A) | F(r')! 1 / 2 

are Hilbert-Schmidt for the class of potentials specified in the following 
definition. 

* Apparently, this factorization method was first discovered by Rollnik [1956], and 
then independently rediscovered by many others (Schwartz [1960], Grossmann and 
Wu [1962], Scadron et al. [1964], etc.). For the sake of simplicity, in (6.11) and (6.12) 
we shall assume that V(r) ^ 0 almost everywhere in [R 8 . 
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Definition 6*1 ♦ A real function V(r) is of Rollnik class if it is Borel 
measurable on U s and if its Rollnik norm || F|| R exists, where 

(6-10) || V ||| = f | F( ^ ^ )I ' dr dr'. 

J 0S« I r — r I 

We easily see that if V(r) in (6.9) is of Rollnik class then X^ (±) (A) are 
Hilbert-Schmidt: 

(6.11) <K<±>( A) | K<±>( A)> 2 = f | K {±) (r> r'; A)| 2 dr dr f = {m 2 \ 4tt 2 ) || V |||. 

V 

A frequently encountered type of potential V(r) of Rollnik class is one 
that is continuous, except possibly for a divergent behavior V{r) ~ r -a , 
0 < a < 1, as r-> +0, and bounded by r _2_e <>, e 0 > 0, as r—► oo 
(see Exercise 6.6). 

6.2. Fredholm Series Expressions for Distorted Plane Waves 

Another great advantage of (6.8) over (6.5) is that the inhomogeneous 
term <?> k (r) in (6.8) is square integrable if V(r) is integrable on U 3 : 

(6.12) || $ k || 2 = f | <?k(r)| 2 dr = (2n)^ f | F(r)| dr = (2tt)-* || V || x . 

*'0S 3 

Hence the Fredholm theory becomes immediately applicable, and we 
can state the following theorem. 

Theorem 6*1 ♦ Suppose that V(r) is of Rollnik class and that* for 
some e 0 > 0 

(6.13) V(r) = 0(r~ 3 ~ € °)> r -> oo. 

Then the modified Fredholm determinants and minors 

r'; A) of J? (±) (r, r'; A) in (6.9) [as given by (5.30) and (5.31), 
respectively] exist, and the infinite series 

(6.14) d {± X\) = i Oo 

n=0 

converge and are different from zero for all A e (0, oo), with the possible 


# Recall that /( r) = 0(g(r)) when | r | — »-oo if there are constants C and R such that 
I /(r) < C | #(r)| for all | r | > R. 
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exception of a compact set Sfy 1 of Lebesgue measure zero for which 
[0, -j-oo) — tf'y 1 C S^. Furthermore, for all k outside the (compact) 
exceptional set 

(6.15) ST y * = (k : (k 2 /2 tn) e 

of Lebesgue measure zero in [R 3 , the Lippmann-Schwinger equations 
(6.5) have the unique solution 

(6.16) 0<»_ W+ - 3 ^£L. 

X f ' &*> (r, r'; V'!\r') <2> k (r') dr'. 


(6.17) 


75 (±) (r,r';A) = £ r,r';A). 


The fact that Sfy 1 (and therefore also £P V S ) is compact and of measure 
zero is established in Appendix 6.7, whereas the rest of the statements 
are an immediate consequence of Theorems 5.3 and 5.4 since V(r) is 
integrable on 1 R 3 so that by (6.12) || $ k || < go. Indeed, due to (6.13) 
V(r) is integrable outside some sphere of radius R around the origin, 
whereas inside that sphere its integrability follows by Theorems 3.9 
and 3.13 in Chapter II from the existence of || V || R . 

Theorem 6.1 establishes that the Lippmann-Schwinger equations 

(4.67) possess for almost all keR 3 unique solutions, but it does not 
establish that these solutions are the distorted plane waves for which 
(4.25) is true. Indeed, (4.25) has been assumed in the course of deriving 

(4.67) , and now we have to prove that this assumption was justified. 
The following lemma provides the first of many steps in that direction. 


Lemma 6*1 ♦ If V(r) = 0(r -3-e °) is locally square integrable and 
of Rollnik class then 


(6.18) 


fatto = I f ^*(r) <A(r) dr 

J os 3 


exist for any k e U z — provided ®k t) (r) are the solutions (6.16) 
of (6.5), and if B C (0, +oo) is a compact set disjoint from then 


(6.19) II E H (B)<p || 2 = f |ft(k)| 2 </k. 

J k 2 /2 meB 
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Proof. To prove the lemma, we first have to make a few remarks 
which are also of independent interest. 

Comparing (5.5) with (5.10) we see that for fixed r e IR 3 , 

(6.20) G„(r, r"; 0 - U.m. ^ { «-«" *. 

Consequently, by Theorem 4.6 in Chapter III, for any given r e R 3 , 

< 6 - 2i > { _ wW - <2 ’ r "‘ l Gi(r ’ ° 

where l.i.m. could be dropped, since by (5.6) G 0 (r, r'; £) is obviously 
integrable in r' over R 3 at fixed r e R 3 . Since by (5.52) and Theorem 5.7, 
G(r, •; £) eL 2 (lR 3 ) for almost all r e R 3 , by analogy with (6.21) we can 
define 

< 6 - 22 > ; _ ^ Um -/„<*■ r,i {) *' 
for C e C 1 - S". Hence by (5.24) and (5.25), 

(6.23) 0 k (r; Q = 4> k (r) + f K( r, r"; £) <P k (r"; Q dr", 

•V 


and upon introducing by analogy with (6.7) 

(6.24) $ k (r;Q = V 1 l*(r)0 k (r;Q, 
we see that (6.23) is equivalent to 

(6.25) £ k (r; £) = <P k (r) + f £(r, r"; £) # k (r'; Q dr", 

* / os 3 

(6.26) £(r, r'; 0 = T 1 /^) G 0 (r, r'; £) | V(r')\'!\ 

Since K{ r, r'; £) is obviously Hilbert-Schmidt, we get in complete 
analogy with (6.16): 

(6.27) 0 k (r; Q = 0 k (r) + f D{r,r'-,Q$ k {r')dr’, 

d\f) rrg3 


B(r, r'; £) = £ At(r, r'; 0- 

n-0 


(6.28) 
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If we now define by analogy with (6.18) 

(6.29) $(k; Q = l.i.m. f <? k *(r; £*) dr, 

•'os 3 

we see from (6.22), (5.51), and (5.52) that 

(6.30) (G(0m k) = a - (k 2 /2m)-^(k; Q. 


Consequently, we obtain from the first resolvent equation (3.62) 


(6.31) 


<0 | [GSCO - G<£*)]0> = ({* 

= 2 i Im £ J 


0<G(Q 1 1 G(W) 
l(k; {) 

oj, i £ — (k 2 /2m) 


2 

dk. 


Thus the connection of ^ ± (k) in (6.18) with the spectral function of H 
finally becomes evident as we consider that by Theorem 5.12 we have 


(6.32) 


<>i> \ (E b » - E a »W) = - lim T dX f 

77 e-^+0 J a 3 


€ | *ft(k; A + ie) I 2 
(A - (k 2 /2m)) 2 + e 2 


dk 


when we take into account that E X H = E x _ 0 for A ^ Sfy 1 since no point 
in [0, oo) — Sfy 1 belongs to Sp^. Indeed, if Hiff 0 = X 0 ift 0 for some 
A 0 > 0 then if/ 0 e @ H C S) v , so that F(r) | ^ 0 (r) | 2 is integrable, and 
therefore its absolute value is integrable (see Chapter II, Definition 3.5); 
consequently, F 1 / 2 ^ eL 2 (U z ) and 

£ (+) (AoXFVV) = F 1 / 2 (A 0 - H 0 )~W* = V^(X 0 - H 0 )-'( A 0 - H 0 W = 

which implies that (1 — j^ (+) (A 0 )) _1 does not exist, and therefore by 
Theorem 5.4, d(X 0 ) = 0, i.e., A 0 e SPy 1 . 

On the formal level, (6.19) with B = [#, b\ C (0, oo) follows from 

(6.32) and Lemma 4.2 by noting that 

(6.33) ®£ +) (r) = <Pi- k >*(r) = Jim ® k (r; (k 2 /2m) + «). 
and consequently for k $ 

(6.34) ^±(k) = lim^(k; (k 2 /2m) ± ie). 

The rigorous proof is, however, considerably lengthier since one 
has to justify the interchange of orders of integration in A and k in 
(6.32) and the application of Lemma 4.2 under the double integral 
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sign with an integrand | $(k; A + ie) | 2 that is e-dependent. As we shall 
see, this can be achieved by proving that | ^(k; £) | 2 is uniformly bounded 
in k and £ as well as continuous in £ in the complex half-planes Im 
J >0 or Im £ < 0, respectively, wherever J(£) ^ 0, so that Fubini’s 
theorem and Lebesgue’s dominated convergence theorems can be 
applied in a routine manner. 

Expanding the determinant in (5.31) along the first row or first 
column we get, respectively, 

(6.35) D n { a, a) = d n K(oc , a) + f Z> n -i( a > a^) Kfa , a) 

= d n K(oc , a') + f K(oc , a x ) , a') d m QL x . 

Combining these two relations we obtain 

(6.36) I D n (oL , a')| — I d n K(oc , a') + d n _ x f K(oL y K (a^ , a') 

I 

+ f l£(a, ot x ) i£(a: 2 , a') Z> n _ 2 (, a: 2 ) </ m a 2 

* B5 m 1 

<1411 ^( a > a ')l 

+ (I 4-1 I + II D n-2 ll 2 ) II ^( a > 011 II ^(■,« / )l|. 


In the case of (6.26)-(6.28) this inequality yields 
(6.37) | B,( r, 01 < C. | ^,)| f 


where due to (5.47) and (5.48) we can choose the positive constants 
C n to be independent of r, r', and £, and such that C = £ n=0 C n < 00 . 
Consequently, by (6.27), 

(6.38) | 0< +) (r; £)| < (2»)-»/«[l + F„(r; £)], £ e « | £ e C 1 , <f(£) ^ 0}, 


(6.39) *V(r; £) 


g ff lV(r')\dr' (r | F(rJ| rfr, W 2 

i(Q| l-V I r — r' I \J„. | r - rj I 2 / 
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Due to the conditions imposed on the potential, F v ( r; £) is locally 
square integrable and bounded at infinity (see Exercise 6.5). Since 
(5.47) yields the uniform convergence and therefore, due to the con¬ 
tinuity of J(£) for Im J >0 or Im £ < 0, also the continuity of <?(£), 
this boundedness is uniform when £ varies over any compact set not 
containing zeros of *?(£). Hence, the procedure described earlier leading 
to (6.19) for B = [a, b\ can be applied if iff( r) e ^^(IR 3 ). The outcome 
can be then extended to arbitrary i/j eL 2 (R 3 ) by taking the limits in the 
mean appearing in (6.18). 

The general case of compact B C (0, oo) disjoint from «$V can be 
reduced to the above special case by using Theorems 1.3 and 1.7 in 
Chapter II. Q.E.D. 

6.3. Asymptotic Completeness and the Generalized Parseval’s 
Equality 

We recall from Theorem 5.8 that locally square integrable potentials 
which satisfy (6.13) give rise to a Hamiltonian H whose continuous 
spectrum contains no negative values, so that by Theorem 6.1 

(6.40) [0, oo) - .0V C S c " C [0, oo). 

Let us introduce for each eigenvalue A e Sj/* an orthonormal basis 
{0 A>V } in the proper subspace of that eigenvalue (which by Theorem 5.8 
is of finite multiplicity if A < 0). Then for any /, g eL 2 (IR 3 ), 

(6.41) </|£"(S p "k> = £ £/*(A,y)i(A,y), 

AeS p * v 

(6.42) / *(A, y) = </1 0 A . V >, £(A, y) = <<& A , V I g>. 

Lemma 6.1 suggests that for the continuous spectrum we might expect 
a similar relation, so that the following generalized Parseval's equality 
[cf. (4.15) in Chapter I] might hold: 

(6.43) <f\g> = <f\ £"(S P H )g> + </ 1 £*(S 0 *)*> 

= £ £/ *(A> y) i(A, y) + I* /±*(k) £ ± (k) dk. 

AeS p " v V 

The nontrivial part of equation (6.43) is the last expression, which 
follows from the next theorem. 

Theorem 6*2* Suppose that V(r) is locally square integrable, of 
Rollnik class, and O(r~ 3 ~ eo ) at infinity. Then the continuous spectrum 
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of H satisfies (6.40), and for any Borel set B C S C H , 

(6.44) (E H (B)<Jj)(r) = l.i.m. f $ ± (k) dk, 

Jk z /2 meB 

(6.45) £t(k) = (0 k w | r? = Li.m. f <P^*(r) ftr) dr, 

• , 0S 3 

for all i/jeL 2 ( R 3 ). Furthermore, for any /, £eL 2 (R 3 ), 

(6.46) </1 E"(B)g> = f /±*(k) | ± (k) dk. 

J k z /2 meB 

Proof. When B is compact and disjoint from S? v \ (6.46) is obtained 
from (6.19) by using formula (5.13) in Chapter III. 

To arrive at (6.44) for such B we choose /(r) e ^^(R 3 ), so that by 
(6.38) 

(6.47) (Inf* I 0^(r)/(r)| < [1 + F r (r; k 2 /2 »)] |/(r)| 

is integrable in (r, k) over R 3 X Aq\B). Hence, for arbitrary g eL 2 (R 3 ), 
the reversal in order of integrations in 

(6.48) </1 E H (B)g) = f dkgJJs) f drd?\r)f*it) 

j a^Hb) V 

= f drf*(r) f dWPWiitto 

J os 3 J ^o Mb) 

is justified by Fubini’s theorem, and we conclude that 

(6.49) (E H (B)g)(r) = f 0< ±> (r)g ;t (k)dk. 

j Aq x (b) 

If we interpret “l.i.m.” in (6.44) for any Borel set B C S C H as a proce¬ 
dure in which the integration is first performed over sets 

(6.50) A~\B n B n ) = {k : A 0 ( k) = (k 2 /2 m) eBn B w }, 

with B n compact and disjoint from and then the limit in the mean 
is taken for B x C B 2 C ••• —► [0, oo) — SPy 1 , we finally arrive at the 
general result (6.46). Q.E.D. 

In view of (6.45), the formula (6.46) can be written in the symbolic 
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notation introduced in (4.82) as follows, 

(6.51) E»(B)=\ \ 0f) dk(0t k> |, 

and therefore (6.41)-(6.43) implies that 

(6.52) £ Z I ®a.v><<Pa.v I + f J rt k ) Jkfet* 1 = 1- 

AGS p ^ v J (k 2 /2m)eS c H 

Thus, Equation (6.52) is intimately related to the following notion of 
asymptotic completeness. 

Definition 6*2* A quantum mechanical theory on with given 
free and total Hamiltonians H 0 and H is said to be asymptotically 
complete if any interacting state W(t) orthogonal to the all bound states 
of the theory is a scattering state, i.e., it has incoming or outgoing 
(physical) asymptotic states Y^t), ex = in, out. The wave operators 
Q ± of a theory are said to be complete if and only if the theory is asymp¬ 
totically complete. 

In §1.4 we denoted by the closed subspace spanned by all bound 
states in the Hilbert space Jf (2) of a two-particle system. In §1.7, 
upon eliminating from the problem the center of mass motion, we were 
left with the Hilbert space Jf (1) of the reduced particle whose bound 
states spanned the closed subspace In the present context of 

wave mechanics for spinless particles we have = L 2 (1R 3 ), so that 

(6.53) jrP = E h (S/)L 2 (U 3 ) = L 2 (R 3 ) ©L| c (R 3 ). 

Hence the theory is asymptotically complete (and Q ± are complete) 
if and only if* 

(6.54) R± = Lf, c ([R 3 ) = E H ( S/) L 2 ([R 3 ). 

On first sight it might appear that by establishing (6.43) or, equival¬ 
ently, (6.52), we have already established the completeness of Q ± for 


* Motivated by primarily mathematical considerations, many authors follow the 
custom of calling Q± complete if and only if the projector onto R± coincides with E H ( S^ c ) 
rather than E H (S C H ), where S^ c is the absolutely continuous spectrum of H. As a rule 
they then proceed to prove (see, e.g., Amrein et al. [1977]) that the singularly continuous 
spectrum is empty, so that in practical terms their definition coincides 

with Definition 6.2, which is dictated by physical rather than mathematical circumstances 
(see, e.g., Schechter [1978]). 
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the class of potentials specified in Theorem 6.2. The trouble is, however, 
that ^{^(r) in (6.45) and (6.52) appear only as solutions of the Lippmann- 
Schwinger equations (6.5), rather than as eigenfunction expansions 
that are related to Q ± via (4.25). Having already proven in §4.6 that 
(4.25) implies (6.5), we still have to complete the circle by proving 
that (6.5) in turn implies (4.25) under the conditions of Theorem 6.2 
(and thus, incidently, establish asymptotic completeness in two-body 
potential scattering). This final task will be carried out in the context 
of proving Theorem 7.2 in §7 by first establishing in Lemma 7.2 the 
existence of operators W ± for which (4.25) holds with iff ± = W ± p if 
we adopt the solutions (6.16) of (6.5) in the role of ^^(r), an< i then, 
at long last, settling the issue in §7.3 by proving that W ± = Q ± . In 
the meantime, we shall derive some additional properties of ^t fc) ( r ) 
that are of great interest physically as well as mathematically. 

6.4. The Scattering Amplitude 

There are two distinct methods of defining the scattering amplitude 
in potential scattering: by means of the T matrix [i.e., by adopting 
formula (6.67), which we shall derive later, as a definition], or in terms 
of the asymptotic behavior of ^^(r) as | r | -> oo. We shall adopt the 
second alternative since it is in that context that the scattering amplitude 
makes its first appearance in most textbooks on quantum mechanics. 

Definition 63* The function / k (0, cf>) is called the scattering am¬ 
plitude of the incoming distorted plane wave ^[ +) (r) in the direction 
(0, (j>) if for a given k e M 3 — ^ v 3 and (0, (j>) e £? s there is an e > 0 such 
that 

(6.55) *> k (r) = (27r) 3/2 [0< +) (r) - $ k (r)] = [/ k (0, <f>)/r] + 0{r~^) 

as r —> oo, where (r, 0, <£) are the spherical coordinates of re IR 3 . 

Theorem 63* If V(r) — O(r~ 3 ~ eo ), r-> oo, is square integrable 
and of Rollnik class then ^ k (r) is locally square integrable at each fixed 
k e R 3 — Furthermore, for each compact set B disjoint from Sf v 3 
and for a certain R 0 > 0 there are constants C^B, R) and C 2 (B , R) 
such that for all k e B, 


(6.56) 


f | v k (r)\ 2 dr < C X (R, R 0 ), 
J r<R 0 


Vk(r) ^M^L e i> 


< 


C t (B, R 0 ) 

yl+tf 


r > R 0 , 


(6.57) 


r 
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where e = e 0 (l + e 0 ) _1 and 

(6.58) f k (6, ft = - m(2„) 1/2 exp [-ik (r' • y)] F(r') 0< +) (r') dr'. 

Proof. The conditions imposed on V{r) in the present theorem are 
the same ones as in Lemma 6.1. Consequently, ^ k (r) satisfies (6.56) on 
account of (6.38). 

Let us introduce the function 


(6.59) sw k (r) = - (e ikr + w k (r)) F(r) = -m(2w)^V(r) 0< +) (r) 
in terms of which by (6.3), (6.5), and (5.17), 

(mo 

J k 3 I r — r x | 

We note that t? k (r) is certainly uniformly bounded at infinity for k e B 
(see Exercise 6.7), so that there is an R B > 0 independent of k e B such 
that 


(6.61) | w k (r)| < const | F(r)|, | r | ^ R s . 

Let us write now (6.60) in the form 

( 6 . 62 ) „ kW _j 

J I r — r x | 

r exp(t&| r — r x |) ... , , . r „ 

+ T- I «fc( r i) dr i = +4 ■ 

J r 


r x >R 


| r - r t 


According to (6.61) there is a constant M such that for suitably large R 
(6.63) | w k (r)| < M , r > R, k eB. 

Consequently, for such values of R 




r x >R 


dr x 

I r — r x I I r x | 3+e o 


By selecting, for fixed r, the z x coordinate axis in the direction of 
the vector r, and working in spherical coordinates, we arrive at the 
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following estimate of the above integral: 

(6.64) 

[- 1 +_ l _1. 

L € o rk e « (1 + € n V 1+e ° j 


47 tM 


0 rR€ 0 (1 + € o>" 1+eo 

Furthermore, using Taylor’s formula, we get for rjr +0, 
exp(/£| r — r x |) = exp {ikr [l - + O 

_l ri l = I[i + o(i)]. 

Consequently, we can write as rjr -> +0 
exp(/&| r — v x [) 


(6.65) 


r -r x \ 

_ exp (ikr) 


(£)][*+ <>(*)]• 


Let us take in (6.62) R = r 1_s , with some fixed S < 1, and then 
choose r so large that (6.64) becomes valid. Inserting (6.65) in I R \ we 
obtain, after noting that for the selected R we have R/r = r“ s , 




Now, by virtue of (6.63), we also have for all k e B 


L, “ p H* -E r 1 -) "‘ <ri) *■ I < 1 *> - 0 (wW) • 

and thus, uniformly in k e B, 


Ir = 


exp (ikr) 


L ex P (-*- L r L -) w k( r i)* 1 


^ ° ( r l +« ) ^ ° Ci+a-»)« 0 ) • 
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On the other hand, for the chosen value of R , (6.64) yields 

I r = ° ( r i+a~su) + 0 ( r i+ E() ) ’ 
so that (with asymptotic contants independent of ke B ), 

®k(r) = Ir + i "r= CXP ^ / 3 ex P (~ ik r fl ) w k(ri) dr x 

r \ r t 

+ ° (~h^) + ° ( r r+<L> 6 -) + o (-^7) • 


Setting in the above estimate 8 = e 0 /(l + e 0 ) < e 0 , we finally obtain 
(6.57). Q.E.D. 

The main significance of the above theorem lies in the fact that it 
provides us with the explicit formula (6.58) for the scattering amplitude 
/ k (0, </>). This formula requires, however, a full knowledge of <& k +) (r). 
On the other hand, even in the case where ^ k +) ( r ) not known, we can 
still obtain an approximation of / k (0, <£) by using (6.16) and neglecting 
the second term. Then we obtain the first Born approximation of the 
scattering amplitude: 

(6.66) f k (6, </>) exp (-ik * - r - ) F(r') exp(ikr') dr' 

= —m{2v) ll2 P{kr - k). 

Higher Born approximations can be obtained by an iteration method 
presented in the next section 

By denoting with io the unit vector with angular spherical coordinates 
(0, (j>) and considering / k (0, cf>) as a function / k (<*>) of io rather than (0, <£), 
we immediately note the intimate relationship between the scattering 
amplitude (6.58) and the T matrix (4.95): 


(6.67) A(«) = -(2 nfm I 0L(r') V(r') 0< +) (r') dr' 

V 

= | V | <P< +) ) 

= ~(27r) 2 m(kio | T | k>. 

Written in this form, / k (<*>) exhibits a remarkable symmetry (intimately 
related to the time invariance of the Schroedinger equation): 

/fcfe( <A) ) = k). 


( 6 . 68 ) 
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Indeed, using (6.5) to eliminate the plane wave <P k *(r) = <P_ k (r) 
from (6.67) we obtain 

(6.69) AM = -{2-nfm f U&V) V(r') *!£.(?) dr' 

+ 2im 2 f dr'0 ( ^(r') V(r') 

x m 

Under the conditions of Theorem 6.2 the integrand of the previous 
double integral is integrable in (r, r') on U e (see Exercise 6.2, and note 
that V(r) is integrable, whereas ^ k +) (r) is bounded in re IR 3 ) so that 
Fubini’s theorem can be applied. Hence the right-hand side of (6.69) 
stays unchanged if we replace k and to with —to and — k, respectively. 

The relations (3.50), (4.96), and (6.67) between S operator, transi¬ 
tion operator, T matrix, and scattering amplitude show that the action 
of the scattering operator on a wave packet 0(k) = ift(kto) can be ex¬ 
pressed directly in terms of the scattering amplitude: 

(6.70) (S0)~(k) = 0(k) + (ik/ln) f / to '(fc) ftkto') dm’. 

This represents yet another rigorous version of the formal relation 
(4.107). 

6.5. The Born Series 

The simplest method of solving an operator equation such as (5.29) 
is by repeated iteration, which then yields 

(6.71) 0 = (1 +a>K + u>*K 2 + ••• + a>*K n + ••■)/> 

provided that this infinite series converges in some sense. Clearly, the 
expression in between parentheses is nothing else than the series for 
(1 — c oK)- 1 (cf. Exercise 7.9 in Chapter IV), and we shall obtain con¬ 
vergence in operator bound if | co | || K || < 1 (see Exercise 5.7). Further¬ 
more, if K is Hilbert-Schmidt and | co | || K || 2 < 1, then this convergence 
in operator bound and is also convergent in the Hilbert-Schmidt norm 
(recall that ||i£|| <||i£|| 2 ). Since for operators with Hilbert-Schmidt 
kernels K(oc> a) such as those appearing in (5.28) we have 

(6.72) || K ||| = (K | K\ = Tr(K*K) = f | K(«, «')| 2 d™u d™*', 
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this convergence becomes (see Exercise 5.4) convergence in the mean 
on L 2 (U 2m ): 

(6.73) ( £ <» n K n ) (<*, «') = 1-i.m. £ a'), 

\n=0 1 N ~ > °° n=0 

(6.74) K n {pL, Oi') = f d m oi n _ 1 K{ci, «„_!) f d. m u nr _ i K.{ct 1tr _ 1 , <x n _ 2 ) 

"• f d m <x 1 K(a 2 , a,) K(a ± , a'). 

J 05 m 

These remarks can be applied in particular to (6.8), so that 

(6.75) ^(r) = A(r) + £ f (^ <±)K )(r, r'; k 2 /2 m) $ k (r’) dr’. 

n=l J 05 3 

Upon eliminating from both sides of (6.75) the common factor F 1 / 2 (r), 
we get a series solution of the Lippmann-Schwinger equations (6.5): 

(6.76) 0^(r) = * k (r) + f G^(r, r'; k 2 /2m) F(r') <Z> k (r') dr' 

* B5 3 

+ f k 2 /2m) F(n) 

* rm3 


X J , r'; k 2 /2m) F(r') <P k (r') + 

This is the Born series for distorted plane waves, which inserted in 
(6.58) provides the Born series for the scattering amplitude, whose 
first term we had encountered in (6.66). Working with the on-shell T 
matrix (4.95) instead of the scattering amplitude, we can write this 
Born series in the symbolic form 

(6.77) <k | T | k'> = £ <k | V(G< +) (A)Vr | k'>, A = k 2 /2m = k' 2 /2m, 

W=0 

in which the nth term equals 

(6.78) (<P k | V(Gl + \k 2 l2m)V) n \ <P k -) 

= f dr& k *(r) V(r) f dr'(G ( +\k 2 l2m)V)«(r, r') 0 k '(r'). 

•'os 3 ■V 
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Theorem 6.4. If F(r) = 0(r~ :i -‘o), r -+ oo, is locally square 
integrable and || F|| R < oo, then there is an energy value A K such that 
the Born series (6.77) for the on-shell T matrix converges uniformly 
in k and k' at all higher energies A = k 2 /2m > X v . Furthermore, if 
II Ur < 2n/m then the series (6.77) converges uniformly in k, k' 6 R 3 
at all energies, and the exceptional set Sf v x is empty. 


Proof. According to (6.78), (6.8), and (6.9), 

(6.79) <k | F(G< +) (A)F)" | k'> 

= (® k I V (Gq +> (A) F)" -1 Go +) (A) | F | 1/2 £ k -> 

= (0 k || F | 1/2 (F 1/2 G< +> (A) | F | 1/2 rdv> 

= <| F |i/*0 k | (F<+)(A))»0 k -> 

is a bona fide inner product in L 2 (R :l ). Since by (6.12), 

(6.80) K| F |V20 k | {KM(X)Y$ k .' } \ < ||(£<+>(A))« || || | F |i/»<P k || || || 

<(2t 7)- 3 ||^ ( +)(A)||« f | F(r)| dr, 
* , os 3 

we see that (6.77) converges if 

(6.81) || £<+>(A)||« = ||[(if<+>(A))*if<+>(A)]|| 2 

< ||(£<+>(A))*£<+>(A)|| 2 < 1. 

However, by specializing (6.72) we get from (6.9), 

(6.82) || £<+>(A))*if<+>(A)|| 2 

= Tr{[(£<+>(A))*i?<+>(A)] 2 } 

= f dr' dr" | F(r') F(r*)l 


X 


IJ G< +) *(r, r'; A) G< +) (r, r"; A) | V(r)\ dr\ . 
By (5.17) the term in braces equals 


( 6 -* 3 ) Jfrf ■ 
477 V 


m 2 (• exp|ffl(| r — r" | — | r — r' |)] 


r — r r — r 


F(r)| dr, k = (2mA) 1 / 2 . 
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In view of the Riemann-Lebesgue lemma* we conclude that upon 
introducing spherical coordinates, the radial integral in re [0, oo) 
approaches zero as k —> oo for any fixed k', k", and (0, cf>). Since for all 
k e [0, oo), the integrand in (6.83) is majorized by an integrable and 
^-independent function that is square integrable in k', k", and (0, cf>) 
(see Exercise 6.4), we deduce from Lebesgue’s dominated convergence 
theorem that (6.82) approaches zero as A —> oo, so that there is a X v for 
which (6.81) holds true whenever A > X v . Taking into account that 
by (6.79) and (6.80) for any such A, 

(6.84) £ |<k | F(G< +) (A)Ff | k')| < (2 tt)-« || V ||, £ || £<+>(A r )||«, 

n —0 n —0 

we see that the first part of the theorem is true. 

According to (6.11), (6.79), and (6.80), we also have 

(6.85) |<k | V{G^\\)Vf | k'>| < (2n)~* II V H, || £ <+, (A)||£ 

<(2n)-°\\VUm\\V\\ K l2nr, 

so that (6.77) converges uniformly and absolutely for all k and k' on 
the energy shell if || F|| R <27 r/m. The same inequality also implies 
that 

(6.86) (1 - ^(A))- 1 = u-lim £ X)) n 

■^^°° n =0 

exists for all Ae [0, +oo), and therefore by Theorem 5.5, d(X) ^ 0 at 
all energies, so that Sfy 1 is empty. Q.E.D. 

6.6. Distorted Plane Waves as Solutions 

OF THE SCHROEDINGER EQUATION 

For the complete identification of ®k t) (r) as distorted plane waves, 
we have to establish that ^k +) (r) is a solution of the Schroedinger 
equation with the eigennumber k 2 /2 m. Theorem 6.6 provides a set of 
precise criteria under which this is true. 

Theorem 6*5* Suppose that the potential V(r) is continuous in 
the neighborhood of a point r e R 3 , and that it satisfies the conditions 


* See Lemma 4.1 in Chapter III, and note that its conclusion remains valid if we change 
variables of integration by setting x = h(y), where h'(y) exists and is positive (or negative) 
almost everywhere. 
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imposed on it in Theorem 6.2. Then the distorted plane waves ^^(r) 
satisfy the time-independent Schroedinger differential equation 

<“ 7) (- 3T ■ 4 + M 

at that point r e R 3 , and for any kelR 3 - \ . 

Before proceeding with the proof of the above theorem, let us make 
a few remarks. 

For the proof of this theorem we need, essentially, only to know that 
V(r) and ^k t) (r) are continuous in some neighborhood of r, and that 

I n*)\ = O | ®^(r)| = 0(1). 

The restrictions on ^^(r) are fulfilled automatically if V(r) satisfies 
the conditions of Theorem 6.2 and the restrictions imposed on it in the 
present theorem. It can happen, however, that ^^(r) satisfies these 
restrictions even when V(r) does not obey all the conditions of Theorem 
6.2 (see §7.2). Hence, the validity of Theorem 6.5 is somewhat inde¬ 
pendent of the validity of Theorem 6.2. 

The proof of Theorem 6.5 is straightforward, and it essentially 
consists in applying the Laplacian A to the right-hand side of equation 
(6.5) and computing the result. This computation is easily carried out 
with the help of Lemma 6.2. 

Lemma 6*2* Suppose /(r) = 0(r ~ 2 ~ e °) for some e 0 > 0, and that 
there is a neighborhood Jf of the point r e R 3 in which /(r) is continuous 
and the function 


(6.88) g{ r) = f exp (^ r r ' D /(r >) dv ’ 

J R 3 I r — r I 

is well defined by the above integral. Then ^(r) is twice differentiable 
at that point in each of the variables x y y, and z, and 

(6.89) (—A^)(r) = ^(r) + 47r/(r). 

Proof. Let S be a neighborhood of r and write 


(A£)(r) = A f 

*> IB 3 -. 


o r-s 


exp(/^| r — r' |) 
| r - r' 


/(r') dr' + A 


exp(/^| r — r' |) 


r — r 


/(r') dr'. 


We shall compute separately the two terms on the right-hand side of 
the above identity. 
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Straightforward differentiation shows that 


(6.90) j_ „e*p(* l r-,'|) 
' ox r — r 


(* - *') exp(*fe| r - r' I) /, L 

I r — r' I 2 \ 



/fion 82 e xp(tfe| r — r' |) 

'’ dx* | r - r' | 

_ exp(t&| r — r' |) 

I r - r' | 2 

3 (* - *7 \ 

| r — r' I 8 / 


X 




1 + k*(x - x'f 3 iHx-x'f 
I r - r' | | r - r' | 2 


On any bounded domain which does not contain some closed neighbor¬ 
hood of r, we have 


|/(r')| < const -j^rrTrj 


Now, the function on the right-hand side of the above inequality is 
integrable on account of the existence of the integral in (6.88) and the 
fact that if a function is integrable then it is also absolutely integrable. 
Hence, we conclude that/(r) is locally integrable. 

It is easy to infer from (6.90) and (6.91) that for sufficiently large 
Ro > 0 and all r x from some open neighborhood SC of r 


d n exp(ifc| r - r'|) , 

dx n I r — r' I * ' ' 


I const 
| r' | 8+< o ’ 
const/(r'), 


r' ^ R 0 

r'eR 8 -5 
and | r' | < R 0 


for n — 1,2. Thus, we deduce that 


' R —<5 


d n exp(ik\ r — r' |) x/ , w , 


• - 1 , 2 , 


exist for r e S. Consequently, d/dx and d 2 jdx 2 can be carried under 
the integral sign when applied to the right-hand side of (6.88) (see 
Exercise 6.8). Since the same obviously holds for the variables y and 
and since by (6.91), 


(6.92) ( 



+ 


a 2 

8y* 


a 2 \ exp(t&| r — r' |) ,, exp(ife| r — r' |), 

-&*) | r - r' | | r — r' | 
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we obtain 
(6.93) 


^ u 3 -s 


exp(*&| r — r' |) 


f(r')dr'=k 2 f 

J u-s 


exp(*&| r — r' |) 


/( r ) dr'. 


I r — r | J | r - r 

Let us choose 

(6.94) 5 = {r x : | x x — x \ <a 9 \y 1 ~y\ <a 9 \z 1 —z\ < a). 

If a > 0 is sufficiently small, then the generalized mean-value theorem 
of the integral calculus can be applied to obtain 


(6.95) 


8x 2 


Is 


exp(*&| r — r' |) 
I — r' | 


Z(r') dr’ 


= lim 4- p dx’ r a dy’ r dz’ l f{r’) 

ax^o Ax J v _ n L_ n dx r — r 


J x—x x —Ax 

_ r y+a r z+a 3 exp (ik | r — r' |) 

"v-a, *z 


aJ 1 =—a 
x j=+a 


y—a ^z—a 


dx 


I r - r' | 


m 


x'=x+a 


dy dz' 


, . r v+a r z+a 8 exp (ik I r — r' I) , , , , \ x ' =x+a 

= -/(*.*>. *o)J I 8^—iZ -=71- -dy dz \ 

J y—a J z—a 


|r 


where | y 0 — y | < a and \ z 0 — z \ < a. Using (6.90) and the mean 
value theorem of integral calculus, we get 


y—CL —CL 


z+a 8 exip(ik | r — r' 


0* I r — r' | L'_-o+* 

exp[*% 2 + y 2 + s 2 ) 1 ' 2 ] 


dy' dz' 


(u 


ik 


1 


r) (2«) 3 , 


<2 2 + y 2 + # 2 \ (a 2 + y 2 + z 2 ) 112 , 

where | y \ < a and | z | < a. This shows that, due to the continuity 
of/(r) in by taking the limit a —> 0 and using (6.90) we shall get in 
(6.95) a contribution of the form 

(6 96) f(r) lim f + ° f + “ ~ 2a exp ^ 2 + & ~ ?')* + <*" "'W 1 - W 

( ) ; w v . -L (« 2 + ( 3 '- y ) 2 + (^- «') 2 ) 3/2 ^ ' 

To compute the above limit, we must compute the following integral 
explicitly: 

r +a r +a exp[V&(<2 2 + y x 2 + z-f) 112 '] 

J-a J-a (« 2 +J1 2 + *i 2 ) 3/2 


dy x dz x 


pir/Ok /»a/COS0 jjy 

8 exp[^(a 2 + r| v ) 1/2 ] J ^ d<f> 


2n 1 


^iexp[i% 8 fr» v ) 1/J ] 
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where r av is obtained by applying again the generalized mean-value 
theorem to the first of the above two integrals, and it satisfies the 
inequality | r av | < ay/2. Hence, the limit (6.96) is equal to —(47r/3)/(r). 

Due to the symmetry of the considered expressions under the 
exchange of x, y, and z, when S is the box in (6.94), we have 


(6.97) 


lim A 

a-* 0 


J S I r — r I 


, 0 * 
= 3 am ——- 

a-»0 CX L 


I. 


exp(i&| r — r' |) 
I r - r' | 


f(r')dr' =4nf(r). 


Hence, the desired result in (6.89) is obtained by combining (6.97) 
with the result (6.93) in which we let a —> +0. Q.E.D. 

In order to be able to apply the above lemma to derive (6.87) from 
(6.5), we have to establish that 

(6.98) /(r) = F(r) $><>) = O , * 0 > 0, 


and that the above function is continuous in some neighborhood of 
the considered point r e IR 3 . 

Since | ®k fc) (r)| = 0(1) for \ (see Exercise 6.7), it immediately 
follows that (6.98) is true. The continuity of ®t fc) ( r ) is an easy con¬ 
sequence of (6.5) (see Exercise 6.9). Hence, in computing the effect of 
applying the Laplacian A to the right-hand side of (6.5), we can use 
Lemma 6.2; then (6.87) easily follows. 


*6.7. Appendix: Analytic Operator-Valued Functions 

We have seen in §5.8 that </| R A (£)g/ is an analytic function on the 
resolvent set C 1 — S A of a self-adjoint operator for all /, g e Jf. This 
categorizes i?^(Q as an analytic operator-valued function. The general 
theory of such functions is based on the following definition. 

Definition 6*4* The vector-valued function/(Q e defined on an 
open domain D of the complex plane C 1 is said to be analytic at £ 0 e D if 

(6.99) m = w-lim{[/(? + JO - 

exists in some neighborhood of £ 0 . 

Lemma 63* If /(£) is analytic at £ 0 then 

(6.100) /'({ 0 ) = s-lim{[/(£ 0 + AQ — /(«]//!£}. 
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Proof. Let ^ be a circle centered at £ 0 that lies within an open 
neighborhood where (6.99) exists. Since obviously <£|/(£)) is for 
any fixed g e Jf 7 an analytic complex-valued function in that neigh¬ 
borhood, we can apply Cauchy’s integral formula from complex analysis 
and write 

(6.101) <* I {[/(Co + AQ -/(Co)]/^C} -/'(«> 

= (2-r 1 f M-KK - (Co + AQt 1 - (£ - So)- 1 ) 
-[(£-«-% I/(£)>« 

if £ 0 + J£ lies within the interior of the disk enclosed by V. Furthermore, 

I (8 1/(0) I is continuous and therefore bounded on the closed and 
bounded (and therefore compact) set V for each g e By Lemma 2.2 
in Chapter III (generalized from sequences to arbitrary sets) | </(£) | g) | 
is therefore bounded uniformly in g e ^ i.e., 

\<g\m>\ <C||*||, £e<T. 

Hence the absolute value of the right-hand side of (6.101) is majorized 
by 

(6.102) (C/2*) Ik II f l(£ - Co - ^C)- 1 - (C - Co)- 1 !! C - Co I- 1 d | c I, 


where the integral can be as small as desired by choosing | J£ | sufficiently 
small. Consequently, the same is true of the absolute value of the left- 
hand side of (6.101), since it is majorized by (6.102) for all g e Jf. 
Hence, (6.100) follows. Q.E.D. 

Definition 6*5* Let A(Q, l e D, be bounded operators defined 
everywhere in Jf. The operator-valued function A(Q is said to be 
analytic on the open domain D C C 1 if A(£)f is an analytic vector-valued 
function at all £ e Z) for any given / e 

Theorem 6*6* If A(£) is an analytic operator-valued function on 
the open connected domain D in C 1 and A(£) is a completely continuous 
operator for each £ e D, then [1 — ^4(£)] -1 either does not exist at any 
£ e D f or it exists at all £gB with the possible exception of a subset 
, which has no accumulation points in D f in which case [1 — ^4(£)] _1 
is analytic on D — *9^ . 

Proof. Since A(£ 0 ) is completely continuous for any given £ 0 e D, 
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according to Theorem 8.4 in Chapter IV, there is a finite-rank operator 

AN = Z I e k y ^k( e Tc I 

such that || A n — A(£ 0 ) || < On the other hand, the analyticity of 
A(£) at £ 0 implies* the existence of a neighborhood D c of £ 0 in which 
|| A(Q — A(£ o) || < i, so that || A(Q — A N || < 1 on D Cq . Consequently 
(see Chapter IV, Exercise 7.9 and Theorem 8.2), 

(6.103) (1 - A(Q + A n )~' = u-lim f (A(Q - A K f 

~*°° fc=0 

exists as a compact operator for all £ e D , and it obviously defines 
an analytic operator-valued function. Furthermore, since upon intro¬ 
ducing the compact operators 

(6.104) A n (0 = A N (1 - A(0 + A n )~' 

= t I O K<e k I (1 - A(0 + A n )-\ 

Tc= 1 

we can obviously claim the validity of the operator identity 
1 - A(Q = [1 - ^(£)](1 - Atf) + A„), UD Co> 

we conclude that 1 — A(£) has an inverse for £ e D Cq if and only if 
1 — A n (Q has an inverse. 

By (6.104) the operator A N (Q is of finite rank and therefore 
(A n (Q — l) -1 does not exist if and only if there is a nonzero / e for 
which 

/ = A N (Qf = £ X^e t | (1 - A(0 + A n )~V> e/. 

i=l 

Since ,..., e N ' is an orthonormal system, the previous equation is 
equivalent to the following system of linear algebraic equations: 

(6.105) 

<«/ I /> - Z A*<e* | (1 - A(0 + A N )~' e k '}<e k ' | /> = 0, i= 1,..., N. 

Tc= 1 


* The detailed argument requires the generalization of Lemma 3.3 to Banach spaces, 
and then the treatment of A(Q as a function in the C # -algebra viewed as a Banach 

space (see Dunford and Schwartz [1957]). 
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In turn, this system has a nontrivial solution for <( e• | />, i = 1,..., N, 
if and only if the corresponding determinant 

(6.106) 8(0 = det{8 tt - | (1 - A(Q + «*'>} 

vanishes. Since, however, (6.106) is analytic on D^ due to the analyticity 
of (6.103), S(£) cannot vanish on any set in D that has accumulation 
points in D c without vanishing identically on D Cq . Thus the theorem is 
established in D Cq , and by standard methods used in analytic continua¬ 
tions, its validity can be therefore ascertained on all of D. Q.E.D. 

The family i?(£) of Hilbert-Schmidt operators with kernels (6.26) is 
obviously analytic in the upper as well as lower open complex half¬ 
planes, so that Theorem 6.6 is applicable to it. However, in solving (6.5), 
we were interested in the existence of (1 — i?(£)) -1 when Im £ —► ±0, 
rather than for Im £ ^ 0. Hence we need the following variant of 
Theorem 6.6. 

Theorem 6*7 ♦ If A(Q is a compact operator-valued function that 
is analytic whenever Im £ > 0 (or Im £ < 0) and uniformly continuous 
for Im £ ^ 0 (or Im £ ^ 0, respectively), then either (1 — A(Q)^ 1 does 
not exist anywhere on the domain of definition of A(£), or it exists 
everywhere on the real axis with the possible exception of a closed 
set ^ of real values of ^ which has Lebesgue measure zero in U 1 . 

The proof of the Theorem 6.7 proceeds in exactly the same manner 
as that of Theorem 6.6 until (6.106) is reached. Then, in ascertaining 
the properties of the set on which S(£) in (6.106) vanishes, we can 
apply a theorem* from complex analysis to reach the desired conclusion. 

By the argument used in the first part of the proof of Theorem 6.4 
we know that there is a X v > 0 such that (1 — i?(£)) -1 exists if | £ | > X v . 
This tells us, first of all, that the set SPy 1 of real points at which 
(1 — i?(£)) -1 does not exist is bounded, and second, that due to 
Theorem 6.7 it has to be closed and of Lebesgue measure zero. 

Exercises 

64 ♦ Prove that the integral in r' of the function | r' | _2_e o | r — r' | -1 
over the region {r'r | r' | ^ Rj} exists when R ± > 0 and e 0 > 0, and 

# That theorem states that if 8(£) is any function that is analytic in the open upper 
(or lower) complex half-plane and continuous in the closed upper (or lower) complex 
half-plane, then the set of points on the real axis at which 8(£) vanishes is closed 
and of Lebesgue measure zero if S(£) is not identically equal to zero (see, e.g., Lemma 8.21 
in Amrein et al. [1977]). 
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that when r —>■ +oo, it behaves as 0(1 /r e o) if 0 < e 0 ^ 1, and as 0(1 /r) 
if e 0 > 1. 

6.2. Prove that if the potential V(r) obeys the restriction of Lemma 
6.1, then the following integral is continuous in r e R 3 and 


f I V(r')\ 

V I r - r ' I 


dr' < C, 


r e [R 3 , 


where the constant C is independent of r. 


6.3. Show that if the potential V(r ) = 0(r~ 2 ~ e o) satisfies the remaining 
conditions of Lemma 6.1, then as r —> oo 


, |F(r,)U,, (°(i^r) tor 0< - <1 

J " ,|r “ r ' l ‘ |o(^) for 


and that the above integral defines a locally integrable function of r. 

6A* Show that if the potential V(r) = O(r~ 2 ~ e o) satisfies the condi¬ 
tions of Lemma 6.1, then as r 1 —> oo for fixed r e IR 3 , 


V(r 2 )\ 


for 0<,,<1 

dr 2 = ( 1 

O (- ) for e 0 > 1, 


I r — r 21 I r 2 ~ r i I ) n (J_) 

and the above function is locally square integrable in r x and r. 


6*5* Prove that if the potential V(r) satisfies the conditions of 
Lemma 6.1, then the integrals appearing on the right-hand side of 
(6.39) exist, and that F v ( r; £) is locally square integrable and bounded 
at infinity in r whenever J(£) ^ 0. 


6*6* Show that if the potential V(r) is locally square integrable, 
and | V(r)\ < C 0 r~ 2-e o, e 0 > 0, for r > R 0 , then the function 
V(r) | r — r' |“ 2 V{r') is integrable on IR 6 if it is integrable on the set 
{(r, r'): r < R 0 , r' < 2? 0 }. 

6*1* Prove that when the potential V(r) satisfies the conditions of 
Lemma 6.1 then the function v k (r) defined by (6.55) is uniformly 
bounded (i.e., | ^ k (r)| ^ const) for r ^ R 0 and k assuming values from 
a compact set D 0 containing no points for which k 2 /2m e \ . 
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6*8* Let h(r,u) be a function with a continuous (in r and u) 
partial derivative h u ( r, u) = dh( r, u)/du , and assume that for r > R 0 

I u)f(v) | < p(r), 

where p(r) is integrable on {r: r ^ 7? 0 }. Show that if h u ( r, w)/(r) and 
/(r) are integrable over the sphere {r: r ^ 7? 0 }, then 

^ = J R s ")/w 

6*9* Show that the function 

^ r) =j «*pw,-ri ) /(fVr , 

is continuous in r if/( r) is locally square integrable and integrable on IR 3 . 

6*10* Show that the family *£y of all functions in ^(IR 2 ) with 
supports disjoint from the set \ (mentioned in Theorem 6.1) is dense 
in L 2 (IR 3 ). 


7♦ Wave and Scattering Operators 
in Potential Scattering 

7.1. The Existence of Strong Asymptotic States 

In the study of a particular scattering model it is crucial to determine 
from the start the magnitude of the family of strong asymptotic states. In 
other words, we have to establish the size of the initial domain M 0 of 
Q ± . 

In potential scattering, under very mild restrictions imposed on the 
potential, this question is answered by Theorem 7.1. 


Theorem 7*1 ♦ Suppose the potential V(r) is locally square inte¬ 
grable and that for some fixed e > 0 

r = |r|-oo. 


(7.1) 



570 


V. Quantum Mechanical Scattering Theory 


Then the initial domain M 0 of the wave operators Q ± corresponding 
to H 0 D — (l/2m)A and H 2 —(l 12m)A + V(r) coincides with the 
entire Hilbert space L 2 (M 3 ). 

The proof of this theorem is based on Theorem 2.8, and it will be 
carried out in several stages. It is important, however, to verify before 
starting with the proof that in the present case H is indeed self-adjoint. 
This self-adjointness of H is a consequence of the restrictions imposed 
on V(r), which has to be locally square integrable and bounded at 
infinity to satisfy these restrictions. Consequently, — (l/2m)A + V(r) 
determines a unique self-adjoint operator by virtue of Theorem 7.5 in 
Chapter IV. 

The starting point of the proof of the above theorem consists in 
choosing an adequate domain which satisfies the conditions of 
Theorem 2.8. We shall prove now that we can choose for 3) 1 the linear 
manifold spanned by all functions 

(7.2) $ p (k) = exp[—(l/ 2 m) k 2 — ikp], p e M z , 


where p varies over all vectors in (R 3 . 

As a straightforward consequence of (5.2) we obtain 

(7.3) (exp( —itH 0 )i/f p ) ~ (k) 

= exp(— 

= exp ( ~\m lt k2 -*e)' 

Hence, by virtue of Theorems 7.3 and 7.5 in Chapter IV we have* 
exp (~itH 0 ) </r p e n Of H , and consequently exp(-itH 0 ) £$ 1 C 2 n 3 > H . 

The second condition imposed on in Theorem 2.8 is that when 
t x -> — oo and t 2 -> +oo, the strong limits of , t 2 ) 0 p exist. We 

have already pointed out, at the end of §2, that the existence of the 
integrals (2.41) and (2.42) is sufficient for the fulfillment of this condition. 
In view of this fact, we shall settle the question by showing that 


(7.4) r\\Ve-™'4>,\\dt<+oo 

J _oo 

for all p e IR 3 . 


Note that Tg = H 0 and T s + V$ = H. 
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By taking the Fourier inverse transform of both sides of the relation 
(7.3), we obtain for any real 8 

I ( ex P( —itH 0 )tfi „){ r ) I 

= (2tt)- 3/2 f„(k) exp [— it + ikrj dk | 

= (2tt)- 3/2 J] t ex P [-V + ik JP ~ /»»)] d K 


we{x,y,z} 


(i + «r 2 


n =p[-xW] 




m 


3/2 


exp 


t- 


m (r — p) 2 


1 + it | 3 ' 2 

= ffi 3/21 r ~ P |- 1/2+ * I * 


2 1 + < 2 

1/2-6 


■] 


1 + it | 1+s I 1 + it 


exp 


m (r — p ) 2 I 

2 1 + t 2 J ' 


If we now take 0 < 8 < 1/2, so that 1/2 — S >0, we get, by multi¬ 
plying both sides of the above inequality by F(r), 


(7.5) 

\{V 

| F(r)| | r - p |*-i/ 2 


< m 3/2 ■ 


(1 + *Y) 1+S 


(r — p) 2 !/ 4 - 

1 + t 2 


■ 6/2 


exp 


m (r — p) 2 
~2 1 +> 


I r — o I 5 " 1 / 2 

< C - | F(r)|, 


1 + it | x + 5 

where C is a constant which majorizes the function 


m 3/2 


(r - p) 2 
1 + t 2 


1/4-6/2 

exp 


m (r — p) 2 1 

2 1 + t* J 


for all values of r, p, and t. If we choose 8 so that in addition 8 < e, 
then by virtue of (7.1) the function on the right-hand side of (7.5) is 
square integrable. Consequently, we can write 


\\Ve- itH »* p \\ <C P 


1 

| 1 + it I 1 * 5 ’ 


where C p is constant with respect to the variable t e R 1 . Thus, (7.4) 
holds and the second condition of Theorem 2.8 is fulfilled. 

This establishes that the closure of the linear manifold spanned 
by all functions (7.2) is contained in M 0 . 
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To conclude the proof of Theorem 7.1, we still have to show that 
= L 2 (IR 3 ), and thus establish that M 0 = L 2 (IR 3 ). To arrive at this 
result we need the following lemma, which is a straightforward general¬ 
ization, to arbitrarily many dimensions, of a theorem by Wiener. 


*Lcmma 7*1 ♦ Let us denote by vf'} the set on which the Lebesgue 
square-integrable function f(x ), x e U n } vanishes. The Lebesgue 
measure of is zero if and only if the linear manifold (e^y) spanned by 
the family of all functions exp \ip • #]/(#), obtained by varying p 

over all vectors p e IR n , is dense in L 2 ((R n ). 


Proof. We set out to prove that, given any ^eL^IR 71 ) and e > 0, 
we can find vectors p lt ...,p 8 eU n and complex numbers a x ,..., a s such 
that 


(7.6) 


f , d*) -/(*) i a/—* 

J D5 n V =1 


2 

d n x < 16c. 


We note first that for any e > 0 we can satisfy the inequality 


(7.7) 


| £(*)| 2 d n x <e, B' =R n - B, 

B' 


with a compact Borel set B, by choosing that set sufficiently large. 
Let us define the functions 


h M (x) = 


g(x) 

/(*) 

\M-^- 


l/(*)l 


I £(*)l /(*) 


if 

if 


g(x) 

/(*) 

gj*) 

/(*) 


< M 
> M 


for any positive integer M\ note that h M {x) is almost everywhere well 
defined, since f{x) vanishes only on a set of measure zero. 

It is easy to see that by virtue of Theorem 3.10 in Chapter II 


lim f | h M (x)f(x) - £(*)| 2 d n x = 0. 

M-»°o J 


Consequently, we can find an M for which 


(7.8) 


f I h M (x)f(x) -g(x)\*d n x < e. 

J r" 
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Since (7.7) obviously implies that 

f I M *)/(*)| 2 d n X < £, 

J B' 

we easily obtain from this inequality and (7.8), by using in the process 
the triangle inequality in L 2 (R n ), 

(7.9) f | g(x) - XB (x) h M (x)f(x )| 2 d*x < 4e. 

J R n 


Let L be a box 


L — ^ x$ ^ /, t — 1,..., w) 

which contains B f and let g t (x) be a function equal to X b( x ) h m { x ) i n L 
and periodic in each of the variables x x x n , with the period 21. For 
sufficiently large l we have 

f n I Xb( x ) h M (x)f(x) - g l (x)f(x)\ 2 d n x 
J u n 

+?/ r 

= E I Xb(») M*)/(*i + 2lk i .*« + 2lk„)f 2 d n x 

fcj.fc n =-°° J L 

<M 2 f \f(x)\ 2 d n X < €, 

J l! 

where L f = IR n — L, and the prime on the summation sign indicates 
that the sum does not include the term with k x = • • • = k n = 0. Com¬ 
bining (7.9) and the above inequality we obtain 

(7.10) f I g( x ) — gi( x )f( x )\ 2 dx < 9e 

J B5 n 

by making use of the triangle inequality. 

Let us expand g^x) in a Fourier series, 


gi(*) = 




k • x\ 
l F 


(2iy 


f gi(*) ex P (~ i7T ^~y) dnx > 


where k = {k t k n ) and the convergence of the above series is in 
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the mean. A straightforward computation yields, for any positive 
integers N and iV 0 < N , 



where the summation extends over all k = (k x k n ) in which 
| kj | > N 0 for at least one value of j = 1,..., n. Due to the convergence 
of X I c u l 2 > the second sum on the right-hand side of the above inequality 
can be made arbitrarily small by choosing N 0 large enough, while the 
first sum can be then made arbitrarily small by taking N sufficiently 
larger than iV 0 . Hence, the sequence of functions 


/„(*) -«,(*) - ^ - tt) - (‘ 


I *. I \ 
N ) 


C k exp 


k • x\ 

i r 


N = l, 2,. 


converges in the mean to zero when N —*■ -j-oo. Thus, it follows from 
Theorem 4.5 of Chapter II that there is a subsequence f Ni (x), 
which converges to zero for almost all values of * e [R m . 

Now we can apply Lemma 3.1 of Chapter IV to infer that 


(7.11) lim f J/ Nr (*)/(*)| 2 ^ = 0, 

after noting that the use of this lemma is justified by virtue of the 
inequality 


I/nW/WI<(I^WI+M}|/wi^ 

In fact, the above inequality follows from the estimate 


ftp* • *»ftn'— W 




((equation continues) 
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(>--*^4 “■’(*' 


k * (x — x') 

1 


)j d n x' 



kA 


N 


j exp 


kj(Xj Xj ) 
/ 


] d n x' 




the estimation of the last integral in the above relation could be carried 
out due to the formula 


' / 

11- 

kj=—N 



sin 2 [AT7r(^ — x/)l2l] 
sin 2 [7r(^. - x/)l2l] ’ 


which can be proven by mathematical induction in N. 

Since (7.11) is true, we can choose one N r = N' for which 


(7.12) 


f „ U(*)/(*)-/(*) z ( i 

J m" I kl . k„—N' ' 


lAi'i... 

AT / 


X 


(‘ 


^r-) c * ex p 


^t7T 


k * X 

~T~ 


2 

d n x < e. 


Combining (7.10) and (7.12), we immediately obtain, by means of 
the triangle inequality, 

g(*)-m i 

k x ,...,k n — N’ ' iy 

••• (i — c * ex p (*"■ ~i * ) | dr>x < i6e - 

Since the above inequality becomes identical to (7.6) after an appropriate 
change of symbols, we have proven the statement that the linear manifold 
(«^y) spanned by is dense in L 2 (R S ). 

Conversely, if (J*y) is dense inL 2 (lR 3 ), then the measure of is zero. 
In fact, if this were not so, then by choosing g(x) = Xje f ( x ) we would 
have 




J » 


-f( x ) Z a »e' 

v=l 




d n x > C I £(*)| 2 > 0, 

J Jff 


so that (7.6) would not be satisfied for all € > 0 by any a x ,..., a 8 e C 1 
and p x ,..., p 8 e Q.E.D. 

If f(p) is the Fourier-Plancherel transform of/(#), then it is easy to 
see that the Fourier-Plancherel transform of f(x)e ip v' x is f(p— p v ). 
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The function f(p — p^ is usually called the translation of /( p) by 
the amount p v . 

By taking the Fourier-Plancherel transform of the integrand in (7.6) 
we obtain 

f . Ik*) - i *,f(P~P,) 2 d”p < 16e. 

J u n I v=l 

Hence, we conclude that the linear manifold spanned by all translations 
of f(p) is dense in L 2 (U n ). This statement constitutes Wiener 9 s theorem 
on the closure of translations. 

Let us return now to the proof of Theorem 7.1. If we set 

/(k) = «p 

then (7.2) assumes the form 

to =/(k) *-*■>. 

Since /(k) is different from zero everywhere on R 3 , we deduce from 
Lemma 7.1 that the linear manifold spanned by all function (7.2) 
is dense in L 2 (R 3 ). Consequently, == L 2 (IR 3 ), and since we already 
know that C M 0 , we finally obtain M 0 = L 2 (IR 3 ). 

This concludes the proof of Theorem 7.1. We note that this theorem 
guarantees that M 0 == L 2 (R 3 ) for all almost everywhere continuous 
potentials F(r) which decrease at infinity faster than the Coulomb 
potential. However, at this stage we cannot say anything about the 
unitarity of the S operator in potential scattering. Only when additional 
restrictions on the potential are imposed, as required in the theorems 
of the next section, will we be able to prove that in potential scattering 
the scattering operator S is unitary. 

7.2. The Completeness of the Moller Wave Operators 

The next theorem will relate the Moller wave operators to distorted 
plane waves, providing at the same time the answers to all the basic 
questions regarding R ± , M 0 , and the unitarity of the scattering operator 
S in potential scattering. 

Theorem 7*2* Suppose the potential V{r) is locally square in¬ 
tegrate and of Rollnik class, and that 

(7.13) l^ (r) l = o(7^-). '-ob. 



7. The Formalism of Wave Mechanics 


577 


for some € 0 > 0. Then the initial domain M 0 of the Moller wave operators 
Q ± is L 2 (R 3 ), Q ± are complete [i.e., the final domains R ± of Q ± are both 
identical to L% (R 3 ) = £' H (S^)L 2 (IR 3 )] and the S operator is unitary on 
M 0 =L 2 (IR 3 ); furthermore, 

(7.14) (0±0X*) = f ^(r)^) dk, 
where $(k) is the Fourier-Plancherel transform of i/r(r), 

(7.15) $(k) = (2tt)" 3 / 2 l.i.m. f 0(r) e** dr , 

and ^ +) (r) are the outgoing distorted plane waves, while $[ _) (r) = 
0 (+)*( r ) are the incoming distorted plane waves given as solutions (6.16) 
of (6.5). 

The validity of statements which are essentially equivalent to the 
assertions made in the above theorem about M 0 , R ± , *S, and Q ± has 
been taken for granted by physicists from the very beginning of quantum 
scattering theory. In proving the above theorem, we confirm that this 
faith was justified only to the extent that the potential satisfies the 
conditions imposed on it at the beginning of the theorem. Consequently, 
it is important to realize that these conditions are essential only to the 
extent of making applicable Theorems 6.1 and 6.2 on distorted plane 
waves. To avoid getting into a maze of technical details we have opted 
for (7.13) from the beginning of our systematic study of in § 6 , 

but in fact (7.13) can be relaxed to 

(7.16) | V(r)\ = 0(r- 2 " e ), € > 0. 

However, in that case one has to base (6.7)-(6.9) on factorizations 
yz | y 11-0 w ith ft G (k that, in general, might* not equal J. Various 
other approaches exist (such as those of Ikebe [1960] based on Banach 
space techniques) that can achieve essentially the same result, but 
clearly, as demonstrated by a counterexample of Pearson [1975], the 
previous unitarity of the S operator does not hold for arbitrary short- 
range potentials. 

In case the potential is spherically symmetric, i.e., V(r) — V 0 (r), 
the above conditions can be further relaxed by replacing (7.16) with 

(7-17) IWI=0(^)> e >°- 


* See, e.g., Amrein et al, [1977], pp. 372-373, or Reed and Simon [1979], p. 353. 
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and making the additional requirement that r%(r) is locally integrable 
on [0, +oo) (see Green and Lanford [I960]). Since the assertions of 
Theorem 7.2 are not true for the Coulomb potential, it is clear that for 
spherically symmetric potentials one cannot hope to do any better by 
further relaxing (7.17). However, in the general case, it is an open 
question whether one could relax (7.16) any further. 

We shall prove Theorem 7.2 in a few stages. The first step consists 
in verifying that the integral operator in the right-hand side of the 
equation (7.14) is a well-defined mathematical entity. 

*Lemma 7*2* If 0j^= ) (r) are the solutions (6.16) of (6.5) with a 
potential satisfying the conditions of Theorem 7.2, then there are 
unique operators W + and W_ defined by 

(7.18) (W^)(r) = l.i.m f <pW(r)&(k)A 

for arbitrary sequences $ 2 (&),..., e converging in the mean to 

f eL 2 (R 3 ), and these operators satisfy the relations 

(7.19) W ± W ± * = E H ( Sf), 

(7.20) W ± *E h (B) = E h \B)W ±*, B e 

Proof. We shall prove the lemma only for W+ , since the cor¬ 
responding results for W_ easily follow from the relation ^k _) ( r ) = 
0LV*(r). 

First of all, we note that under the present restrictions on the potential, 
Theorem 6.3 holds. It follows that for any given bounded closed set B 
which is disjoint from the exceptional set defined in (6.15) 

we can find a locally square integrable and bounded at infinity function 
C 5 (r) such that 

(7.21) | ^ +) (r)| = |(27r) _3/2 [e* kr + *> k (r)]| < C B ( r) 

as long as k assumes only values from B . 

Let us consider the bilinear form 

(7.22) (0lf)= f ,f*(k)^(k)* 

J R 3 

in if/ and ifj\ where 

(7.23) 


#(k) = (2 tt)- 3 / 2 f e- i] “if>( r) dr , 
J R 3 
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(7.24) &'(k)= f y^*(r)^(r)dr. 

J R 3 

With the help of Theorem 6.2 we show that this bilinear form is bounded: 

1(0 I 0')l < t J r3 1 ^ (k)|2 rfk fl 1 ^ +,(k)|2 dk \ 12 

= H\\\\e«(S«w\\^w\\w'\\. 

Consequently (see Chapter III, Exercise 2.5), there is a unique bounded 
linear operator W + satisfying the relation 

(7.25) <W^ |f> =(0|f) 
for all eL 2 (R 3 ). 

Let us choose now $(k )e ( £y and 0'( r ) G ^b (R 2 ), where is the 
family of all functions in ^(R 2 ) with supports disjoint from c^ 3 . For 
such k), “l.i.m.” in (7.18) can be dropped, and (7.22) can be written 
in the form 

(7.26) (0 | 0') = J K /k0*(k) j 0< + >*(r) 0'(r) dr\. 

Since (7.21) is satisfied by 0 k (r), we immediately find that for such 
choices of ^'(r) an d *A(k) the function $*(k)0£ +) *(r)0'(r) is integrable 
in R 6 . Consequently, by Fubini’s theorem, we can interchange the 
order of integration in (7.26), thus arriving at the result 

(7.27) < W + .p | 0') = J dr 0'(r) j J ®l + \r) 0(k) dk j *. 

Since ^^(R 2 ) is dense in L 2 (R 2 ), equality (7.27) holds for all i/r'(r) E 
^(R 2 ) if and only if 

(7.28) (W + i[>)(r) = f <P< +, (r) f(k) dk 

J R 3 

almost everywhere. Hence, (7.18) is established for ifie&y . Moreover, 
due to the fact that tfy is dense inL 2 (R 3 ) (see Exercise 6.10) the general 
validity of (7.18) for W + and arbitrary if/eL 2 (R 2 ) is an immediate 
consequence of (7.28). 

In order to express W + * as an integral operator, let us take in (7.22) 
\fj(r) e ^(R 2 ) and write for compact B n -> R 2 — 
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Since, under the present conditions, the function 0*(r) e ikr 0 + '(k) is 
integrable in r and k on U 3 x B n , we can invoke Fubini’s theorem to 
invert the order of integration in k and r: 

<0 | W + *if>'y = (27t)- 3 ' 2 lim f dr 0*(r) j f e* kr # + '(k) dk\ . 

^ J R3 U Bn ) 

Now, the type of reasoning earlier applied to (7.27) leads us to the 
conclusion that 

(7.29) (FF+YXr) = (2w)-*« l.i.m. f e ik '$ + {k) dk 

J R 3 

for arbitrary $+(k) given by (6.45). 

Combining (6.48), (7.18) and (7.29), we obtain 

(W + W + *4,){ r) = \W + {W^)\{r) = l.i.m. f <J^ +) (r )(W + *^)~ (k) dk 

J R 3 

= l.i.m. f #< +) (r)&(k)</k = (E h ( S^)(r), 

J R 3 

where the last step follows from Theorem 6.2, and in particular from 
(6.44). Hence (7.19) is established. 

Finally, combining (6.44)-(6.46) with (7.29), we easily derive 

(W+*E H (B)il>){v) = (2tt)- 3 ' 2 l.i.m. f e* kr (F*(£)0r (k) </k 

J R 3 

= (2tt)- 3 / 2 l.i.m. f e ikr $+(k) dk, 

^ k a /2rnej 

FF + *0)(r) = (2 t 7)- 3/2 l.i.m. f e ikt (W + *i/>)~ (k) dk 

^ k a /2 meB 

= l.i.m. f e ikr $ + (k) dk. 

J k a /2 meB 


This establishes (7.20). Q.E.D. 


*73* Proof of Asymptotic Completeness 

We are now ready to carry through the proof of Theorem 7.2 to its 
completion by showing that W ± = Q ± . 

Let us adapt the relation (2.37) to the present case, setting t x = 0, 
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t 2 = t, and / = ip to obtain 

(7.30) <(%) - 1)0 | g> = i f (<«-"•* 0 | He~ im g y - 0 | *-"**» dt 

for any g e . 

Since the restrictions imposed at present on the potential meet the 
demands of Theorem 7.5 in Chapter IV, we infer that if iff e S) H , then 
ifs e @ v . Hence, for iff e , (H — H 0 )e~ iHot iff — Ve~ iHQt iffy and the 
relation (7.30) assumes the form 

(7.31) <^(r)0 \g> = <iff\g>+i f ie im Ve- iH ^ iff \ g y dt. 

J o 

The above relation can be extended to arbitrary £eL 2 (IR 3 ). Setting 
g — W_<f> 9 <f>eL 2 (R 3 )y and using the relation W_*e iHt = e iH <* 1 W_* 9 
which is a direct consequence of (7.20) (see also Exercise 2.1), we obtain 

<<f> I W_*a(r)iffy = <<£ I W_*ifi) - i f <<£ I e**'* W^Ve-**'* iff > dt. 

J 0 

Since it can be readily shown (see Exercise 7.1) that 

(7.32) f <0 I e iH °‘W^Ve-W 0> dt 
J 0 

/•+ 00 

= lim J e“ E< <0 | e ia,,t W_ * Ve~ iH<>t 0> dt, 

we easily arrive at the result 

(7.33) 

<0 | = Jim <W_<f> | Q( r)0> 

= <0 | W_*4>y - lim i f + ” c“ e< <0 I e^Wj^Ve-^ 0> dt. 

f-»+0 J 0 

It should be emphasized that, in this context, all the integrals over the 
infinite interval [0, + oo) are improper Riemann integrals, and not 
Lebesgue integrals. 

We can calculate the integrand in the above relation by using (7.29) 
and (4.8), 

(7.34) <«£ | e iH ^W^Ve~ iH ^ iff} = f <£*(k) iff )~ (k) dk 

J nj8 


((equation continues) 
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= f <£*(k) e i(k 2 /2m)(( Fe -W 0 « (k) dk. 

J R 3 

We resort now to the relation (6.45), which for 0(r) e ^(R 3 ) yields 

(7.35) (Fe- <H «V):(k) = f *(r) F(r)(e- </r »V)(rWr. 

J R 3 

We note that the limit in the mean in (6.45) has been dropped in the 
above relation, since the integrand in (7.35) is integrable on U 3 . To 
arrive at this conclusion, we recall (7.21) and note that (e~ iHot ifs)(r) 
is uniformly bounded for all t e U 1 (see Exercise 7.2). 

By combining (7.34) and (7.35) we obtain 

(7.36) <# | exp (iH 0 t) W_*V exp 

= f dk <£*(k) f dr 0<- ) *(r)F(r)(exp[-i(H o - k 2 /2m)<].A)(r). 

J R 3 * R 3 

Let us show now that when ifs(r) e ^(R 3 ) and B is compact and 
disjoint from \ we have 


(7.37) 


f a ^- ) *(r)F(r)(exp[-t(F 0 - k 2 /2m)^)(r) dr 

J R 3 


< c 09 


where C 0 is a constant independent of k e B and t e M 1 . Indeed, since 
(exp[— i(H 0 — k 2 l2m)t]iff)(r) is uniformly bounded in r, ke U z and 
t e U 1 (see Exercise 7.2), by using (7.21) we get 

f I ®i _) *(r) F(r)(exp[—«(H 0 - k 2 /2*«)/]^)(r)| dr < const f V(r) C B (r) dr, 
J M 3 V 


thus establishing the validity of (7.37). 

Let us insert (7.36) in (7.33) and choose <£(k) e ^(R 3 ), thus obtaining 

(7.38) <<A | W_*Q_+> = <<A | W_*y> - *), 

where 


(7.39) /,(<£, $) = / f°° dt er<* f dk$*(k) 

J 0 R 3 

X f dr >*(r) F(r)(exp[-t(H 0 - k 2 /2ffi)<]^)(r). 

J R 3 
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Using (7.37), we easily infer that if supp <J> C JB the above integrand is 
integrable in ie [0, + oo) and k e R 3 . Consequently, the reversal of 
the order of integration in t and k is justified by Fubini’s theorem. 
Hence, we have 

(7.40) 

dk<j>*(k) f dt(F»\ exp [~i(H 0 - k 2 /2m - u)W>, 

05 3 J 0 


(7.41) Ff(r) = Xil (k)<5 ( -)(r) V(r). 

Using Theorem 3.1 of Chapter IV, we infer that 

-*)<]*) 

= ' * (** I ‘ xp [ _ ‘ ( H ° - m ~ ") '] (i nr + “) *)• 

Consequently, we easily compute 

- ■' “a <« IK [-' («•-£- •'*) *]-■)*• (£-■+») *) 

= -'<* I ■*• dr + “) +) - -'(*• (4- - “W *)• 

If we combine (5.5) and (5.6) with the above result, we get 
TU M _ m f ( f exp[*(k 2 - 2we) 1/2 | r - r' |] 

X F(r') 0<->(r') dr' J*<£*(k) 0(r) rfr rfk. 

Inserting this result in (7.38), and noting that 


(7 42) fa k w. - ^' r - ° y w w 

- L r- “ W w 
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and that the absolute value of the integrand in the above limit is 
majorized by | r — r' | _1 | F(r')| C B (r') when k e B, we arrive at the 
following result: 


(7.43) <# | = (W_^> | </>> - 


m r j r exp(— ik [ r — r' |) 

277 J os 6 ( J r 3 | r r | 


X V(r') ^- ) (r / ) dr' j*<£*(k) ifi(r) dr dk. 


From (7.18) we obtain 


(7.44) 


<W-+ l*> = f dr i/i(r) f ik0‘->*(r)£*(k) 

J H5 3 j (r 3 

= f <P<“>*(r)^*(k) >P(r)drdk, 

J ira® 


where in view of (7.21) the second equality follows from Fubini’s 
theorem. 

Substituting the expression for | */r> from (7.44) in (7.43), 

we obtain 


i<f> | = j 3 dk$*(k) J 3 dr #r) \<t>i~\r) - ~ 

= dk$*(k) f dr e- ikr 0(r) = f dk#*(k) fi(k) 

J J J 


by using in the process the Lippmann-Schwinger equation (6.5) for 

$><-> (r). 

In the above relation iff(r) e ^(IR 3 ) and <£(k) e ^y . Since ^(IR 3 ) is 
dense in L 2 (IR 3 ), it follows that W_*Q_ = 1. Consequently, by virtue 
of (7.19) and the fact that E R ^ ^ H (Sc)> we conclude that 

W_ = W_W_*Q_ = E H ( Sf) Q_=Q_. 

Completely analogous considerations lead to the conclusion that 
W + — Q + . Hence, we have thus established the representation (7.14) 
for the wave operators Q ± in potential two-body scattering for arbitrary 
i/j eL 2 (R 3 ). This shows that in the present case M 0 == L 2 (IR 3 ). Moreover, 
in view of (7.19), we have also established that R + = R_ == Ljy o ((R 8 ). 
Consequently, the proof of Theorem 7.2 has been completed. 
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7.4. Phase Shifts for Scattering in Central Potentials 

We had mentioned in §7.8 of Chapter IV that {| P |, | L |, L {z) } is a 
complete set of observables next in importance to the position and 
momentum complete sets of observables. In §4.3 we had briefly outlined 
how the stationary scattering theory can be formulated in terms of 
partial (or spherical) waves constituting an eigenfunction expansion for 
(| P |, | L |, L {z) }. We now propose to prove by a very general argument 
that indeed for spherically symmetric potentials 

(7.45) V(r) = V 0 (r) y r = | r |, 

the S operator can be expressed in terms of the phase shifts S^k) men¬ 
tioned in §4.3. 

We already know that H 0 in (4.14) or (5.1) and (5.2) commutes with 
(| P |, | L |, L {z) }. Obviously, the interaction Hamiltonian H 1 = V 
does not commute in general with | P |, but it does commute with | L | 
and L {z) (in the sense of Definition 1.2 in Chapter IV) when (7.45) is 
satisfied. Indeed, in that case V = V Q (\ Q |), and by Theorems 2.5 
and 2.6 in Chapter IV, all we have to prove is that the spectral measure 
EM(JB) of | Q |, 

(7.46) (E\o I(£)0)(r, 0, <f>) = Xs (r) 0(r, 0, <£), 

commutes with the spectral measures of | L | and U z) —a fact that is 
implicit in the relation (7.43) of Chapter IV. 

Since H 0 and V do not commute, we cannot argue that | L | and L (z) 
commute with H = H 0 + V (in the sense of their spectral measures 
commuting) simply because they commute with H 0 and V. This result, 
however, can be deduced from the following theorem by Trotter 
[1959], which finds many other applications in quantum mechanics. 

*Lcmma 73* If A x and A 2 are self-adjoint operators and A x + A 2 
is self-adjoint on Of — S) Ai n Of , then Trotter's product formula 

(7.47) eUAi+Ajt = e iA *^ n ) n 

n-> oo 

holds true for any t e U 1 . 

Proof. By Theorem 3.1 in Chapter IV and Exercise 5.6 in Chapter 
III we have for any / e Sf y 

s-lim r~ 1 (e xA i T e lA 2 T f — f ) = s-lim e lA \ T r~ 1 (e tA 2 T f — f ) 

+ s-lim r-\e iA x^f — f) 

(i equation continues) 
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— K^if + ^ 2 /) 

= s-lim T _ 1 (e* u i + ^2 )T / —/). 

Consequently, if we set 

M(t) = r~ 1 (e i(A i +a 2 )t — e iA i T e iA 2 T ), 

then || M{r)f\\ —> 0 as t —> 0 for all fe Of. 

To prove that as t— >0, || M(r)f\\ converges to zero uniformly on a 
dense set of vectors/ e®, we shall employ Lemma 2.2 from Chapter III. 
Hence we introduce in Of the new inner product 

(7.48) </1 = ((A, + A 2 )f | (A, + A 2 )g> + </1 ;>, 

and we note that since A x + A 2 is self-adjoint (and therefore closed) 
on Sf y the linear set Of is a Hilbert space with respect to this inner 
product. Furthermore, the linear functionals 

+rM) = »!%)/>. \\g\\® = 1, TG[-1, +1], 

are bounded in /, g e Q) (with inner product <• | -)^), so that by Lemma 
2.2 in Chapter III (which can be easily extended from sequences to 
arbitrary sets of linear functionals), we conclude that 

\+rM)\<Cl> \\f\\® = \\g\\®=h TG[-1,+1]. 

Since obviously a similar constant C 2 can be found if t e U 1 — [ — 1, -f-1], 
we can state that 

l|M(r)/|| <C||/b, fe&, teR\ 

so that in accordance with (1.16) in Lemma 1.1, || M(t)/|| —> 0 uniformly 
in/e^, \\f\\a = 1, as t-»0. 

To apply this result we write 

||{expD'(A + A 2 )tjn] n }f — [expiiA^jn) exp(iA 2 tjn)] n f\\ 

I n —1 

Y [exp (iA^/n) exp (iA 2 t/n)] j 

j=o 


X {expp^j + A 2 )tjn ] — exp (iA^/n) exp (iA 2 t/n)} 
X {exp[*‘(^i + AJt/n ]}*-*- 1 /1 


(i equation continues) 
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<|*| sup \\(tln)~ 1 {exp[i(A 1 + A 2 )t/n] — expexp (iA 2 t/n)} 
lw|<UI 

X expOX^! + 4>]/ll, 

and note that as n —> oo (and therefore tjn —> 0), the last of the above 
norms converges to zero uniformly in u e [— | 1 1, + | t |] since 
|| exp p(^4i + A 2 )u] f\y = \\f\y for any fixed fe3. Consequently, 
(7.47) is established when applied to fe3. 

Since 3 is dense in and the operator on the left-hand side of 
(7.47) is bounded and defined everywhere in ^ the extension principle 
can be invoked to immediately extend this result to all f^^F. Q.E.D. 

Since by Theorems 7.3-7.5 in Chapter IV we have that 3 H — 3 Hq C 
3 V , Lemma 7.3 can be applied if V(r) is locally square integrable and 
bounded at infinity, so that 

(7.49) exp (iHtj) exp (iL {z) t 2 ) = s-lim[expexp (iVtJn)]' 1 exp (iL {z) t 2 ) 

= s-lim exp(z‘L (2) * 2 )[exp(/i/ 0 * 1 /«) exp (iVtJn)] 71 

tt ->00 

= exp (iL iz) t 2 ) exp 

and similarly for | L |. Hence by Theorem 6.2 in Chapter IV, we conclude 
that | L | and L {z) commute with H. 

Theorem 73* If V(r) = 0(r -1-e ), r —> oo, is spherically sym¬ 
metric and locally square integrable, then the S operator is a function 
of {P, | L |, L (z) ), 

(7.50) S = F s (| P |, L', LW), | L | = [L\L f + l)]i/» 

(7.51) Fs(k, /, m) = exp[2zS^)], 

where S^k) are unique modulo tt real functions that do not depend on m. 

Proof. Under the present conditions on V(r), our earlier considera¬ 
tions leading to (7.50) are valid, so that | L | and L (z) commute with 
both H 0 and H. Consequently, | L | and L iz) commute also with the 
M oiler wave operators, since, e.g., 

E l(z \b)Q ± = E l(i! \b) s-lim e iHt e~ iH ^ 

= s-lim e im e-^E^fB) 

t -*^ 00 

= Q ± E i( '\b) 


(7.52) 
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for any B e 3S 1 . Since S = Q_*Q + , we also have 

(7.53) [. E & \b), S] = [E lu (B), 5] = 0, Be 

On the other hand, since | P | = (2m J ff 0 ) 1 / 2 , we have by Theorem 2.3 
in Chapter IV, 

(7.54) £ |p| (£) = E h °(F~\B)), F( A) = (2mA) 1 / 2 , 

so that according to Theorem 2.3, | P | also commutes with S. Therefore, 
by Theorem 5.6 in Chapter IV the S operator must be a function of the 
complete set {| P |, | L |, L iz) }, i.e., (7.51) is satisfied. 

To show that this function is of the form 

(7.55) F s (k , /, m) = exp[2 i 8(k, /, m)\, S(k, /, m) = S*(A, /, m ), 

we note that by Theorem 7.1, M 0 = L 2 (R 3 ), and that [as we had re¬ 
marked when discussing (7.17) in §7.2], R + — R_ , so that by Theorem 
2.5 S is unitary on L 2 (R 3 ). Consequently, by Theorems 2.6 and 2.7 
in Chapter IV, 

(7.56) F s *(k,l,m)Fs(k,l,tn) = 1 

almost everywhere on R 3 with respect to the measure ^, 0 introduced in 
§4.3. Thus, (7.55) must be satisfied, the functions 8(k, /, m) being 
obviously ^ 0 -almost everywhere unique modulo tt. 

The fact that 8(k, /, m) does not depend on m, and that therefore 
(7.55) reduces to (7.51), follows from a general group-theoretical 
theorem*. Q.E.D. 

We note that Theorem 7.3 establishes the existence of the phase 
shifts Sj(&) without any reference to distorted spherical waves. Therefore, 
as yet we have not proven that the functions S z (&) appearing in (7.51) 
coincide with those in (4.36). However, this result follows from the 
further considerations of §7.6. In the meantime, we shall establish in 
§7.5 that (7.50) and (7.51) have a generalization of sorts valid even 
when V(r) is not central. 

7.5. The General Phase-Shift Formula for the Scattering 
Operator 

Let us introduce in the space , ^ s ) defined in §7 of Chapter II 

[see (7.16) of Chapter II] operators analogous to (3.50) [c.f. also (4.97) 

* The Wigner-Eckart theorem (see, e.g., Miller [1972], Section 7.10). 
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and (6.67)]: 

(7.57) S(k) = 1 - 27riT(k), k e [0, oo), 

(7.58) (T(k)u)(<a) = f / (to ')(w) m(oj') d<a', u eL\Q s , ^ a ). 

K^ w ) J n s 

The unitarity of the S operator imposes the condition that S(k) 
should be unitary in L 2 (Q S , pu s ) for almost all k ^ 0: 

(7.59) S*(k) S(k) = S(k) S*(k) = 1. 

As a matter of fact, for arbitrary u e L 2 (Q S , pu s ) and p e L 2 (Q V , p, v ) 
[see (7.16) of Chapter II] it easily follows from (4.96), (6.67), and 
(7.58) that for (p • u)(k , oj) = p(k) u(io) 

(7.60) ( S*S P • «)(*, <*) = p(k)(S*(k) S(k)u)(t*). 

Consequently, we have for arbitrary u, v e L 2 (Q S , p, 8 ) 


f°° | P (k )\ 2 k 2 dk f v*(io)(S*(k) S(k)u)(io) da> 
J o J n s 

= f {p • ©)*(*, io)(S*Sp • ii)(w) k 2 dk da> 
= f (p • ?>)*(&, o>)(/o • m)(&, o)) dco 

= J | / o (&)| 2 & 2 dk J v *( cj ) u(<o) da>. 


This means that | S(k)u j y = 8^ at almost all k e [0, oo) for any 

given i, j ~ 1, 2,..., if u ± , u 2 ,•••> is an orthonormal basis in L 2 (Q S , ju, s ). 
Since the set {(f,j): f, j = 1, 2,..., } is countable, this implies that 
S'*(&)aS'(&) = 1 for almost all k e [0, oo). A similar argument establishes 
that S(k) S*(k) = 1 also almost everywhere in [0, oo). 

The unitarity of the operators S(k), 0 ^ k < +oo, enables us to 
derive Theorem 7.4. 


Theorem 7*4* Suppose the potential V(r) satisfies all the conditions 
imposed on it in Theorem 7.2. Then for almost all k e [0, + oo), there is 
an orthonormal basis {w^k), w 2 (k),...} in L\Q 8 , pu s ) such that 

(7.61) S(k) = ]T | w v (k)} e 2i8 v (k \w v (k)\ = s-lim ]T | w v (k)y e 2i6 v (h \w v {k)\, 
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where the numbers S^), S 2 (k),... are real and such that 

(7.62) Tr[r*(6) T(k)] = £ | sin S„(6)| 2 < +oo. 

V =1 

Proof. The kernel T {1) {k\ co, co') = — (27r)~ 2 kf k& ,(io) of the operator 
T(k) is square integrable in (co, co') on Q a x Q a with respect to the 
measure fju s X fJL a (see Exercise 7.3). Hence, the operator T(k) on 
L 2 ( Q a , ^ s ) is of the Hilbert-Schmidt type* and therefore completely 
continuous. 

Since S(k) is unitary for almost all k e [0, oo), we can apply Theorem 
6.1 of Chapter III and write 

/.2tt 

(7.63) S(ft)=f &dE x (k), 

J 0 

where the above integral converges in the uniform sense of (6.1) in 
Chapter III. We shall prove now that the complete continuity of the 
operator T(k) implies that the spectral function E x (k) is constant in A 
except for an at most countable number of discontinuities A| 0) (^), 
W),..., where E#>(k) * E^_ 0 (k). 

Suppose to the contrary that there is a point A 0 e (0, 2tt) such that 
E^k) = E^k) but EJk) ^ E h {k) for any a < A 0 < b such that 
a, b e (0, 2tt). Then we can choose a sequence A x , A 2 ,... —> A 0 which is 
monotonically increasing or decreasing, and such that E Xn (k) ^ E x (k). 
To be specific, let us say that A x < A 2 < ••• < A 0 , and let us choose 
vectors f x ,/ 2 e ^ which are such that {E K Jk) — E x {k))f n = f n 
and ||/ n || = 1. We shall prove that no subsequence of T{k)f x , T(k)f 2 ,... 
is convergent, thus contradicting the complete continuity of T(k). 

In fact, it is easily seen that T(k)f m X T(k)f n for m ^ n (see Exercise 
7.4), and consequently 

|| T(k)f m - T(k)f n || 2 = || T{k)f m || 2 + || T{k)f n || 2 . 

On the other hand, || T{k)f n || > C > 0 for all n = 1, 2,..., 


II m/n II 2 = X - O/- I ( 5 (*) - !)/•> 


= Xf \e u — \\ 2 d II E x {k)f n II 2 

477 ■ , [A„.A n+a ) 

| sin A | 2 || E x (k)f n || 2 

[A n .An+i) 




T~2 

47T 2 1,2,... 


| sin A n | 2 > 0. 


* This can be easily inferred by setting fi 8 (B) = 0 for B C [R 2 — £2 g , thus extending 
H B to R 2 , and then applying the result of Exercise 5.3 and of Theorem 8.6 in Chapter IV. 
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Thus, no subsequence of T{k)f x , T{k)f 2 ,... is a Cauchy sequence, 
which is impossible if T(k) is of Hilbert-Schmidt type. Hence, E x (k) 
can increase only in a discontinuous manner. 

Since L\Q B , /jl b ) is separable, there can be at most countably many 
such points Aj 0) (&), A£ 0) (&),... of discontinuity. Thus, (7.63) becomes 

S(k) = £ - E Cw _ 0 (k)), 

n=l 

where it is easily seen that the convergence of the series is in the strong 
sense, but not necessarily in the uniform sense (think of the case S(k) = 1, 
when the convergence is not uniform). 

If we introduce an orthonormal basis in each one of the mutually 
orthogonal subspaces on which E x b\ k )(k) — E^)^ 0 (k) 9 n= 1,2,..., 
project, we obtain an orthonormal basis in L 2 (Q B , ^, g ) by taking the 
union of all these bases. After adequately labeling the elements of these 
bases, and replacing A^ 0) (&) with the adequately labeled 8„(&) = ^A£>(&), 
we obtain (7.61). 

The corresponding expression for T(k) is 

(7.64) T(k) = JL {S (k) - 1) = i £ | «,„(*)> ***« sin 8„(k) <«v(*)|. 

Thus, we have 

(7.65) T*(k) T(k) = 1 £ | «,„(*)> | sin 8^*)|» <«> V (A)|, 

and since T(k) is a Hilbert-Schmidt operator, (7.62) follows. Q.E.D. 

We have seen in the preceeding section that when the potential V(r) 
is spherically symmetric, physicists call the numbers S„(&) phase shifts . 
We can apply this terminology (see also Ikebe [1965]) to the general 
case, when V(r) is not necessarily spherically symmetric. Such a name 
for 8^) seems appropriate, since the absence of interaction is character¬ 
ized by vanishing phase shifts S„(&) = 0; in fact, when there is no inter¬ 
action we have 5=1, and therefore 

m = i = £ i «v(*)x«v(*)i. 

V=1 

Furthermore, if w v (k) = Yf 1 for all k e [0, oo) then 8 v (k) equals S^k) 
[see (7.76)—(7.78)]. 

If we expand /(^(to) in the orthonormal basis {w 1 w. w 2 (k),'..} and 
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compare the Fourier coefficients of this expansion with (7.64), we get 
(7.66) /(to')(<») = —4 tj hr 1 £ w„(k; to) e iS * m sin 8 v (k)w*(k; to'). 

V=1 


Hence, the total scattering cross section o(& 2 /2m) at a given energy 
k 2 /2tn can be expressed conveniently in terms of phase shifts: 

(7 ' 67) " (“£■) = I CT (-JnT ’ ") do> = ^ % 1 W ^ k ' Cz) Sin Wl*- 

Naturally, both formulas (7.66) and (7.67) are unambiguous only if 
T(k\ a)', o>) and w v (k\ co') are continuous functions of co', so that (1.47) 
is valid. 


7.6. Partial-Wave Analysis 

for Spherically Symmetric Potentials 

If the potential V(r) is spherically symmetric, i.e., if (7.45) is satisfied, 
then the functions w v (k\ co) can be easily computed. 

Using the well-known Bauer's formula (see Exercise 7.5), 

00 

(7.68) exp (ikz) = exp(*&r cos 0) = £ i\2l + 1 )j\(kr) P t (cos 0), 

1=0 

and combining it with the addition theorem for spherical harmonics,* 

(7.69) j>,(« • i Yr(*)(YrM*, 

1 m =-l 

we obtain upon setting YiU™) = 

(7.70) exp[t£r(to • to')] = 4w £ £ ^(fo - ) ^(w) ^t( w '). 

Z=0 m=—l 

where co = (0, <£) and = (0 X , t^) are spherical coordinates which 
correspond to the unit vectors ugU 3 and a> 1 e R 3 , respectively. Let us 
insert the above series in the integral in (6.58) defining / k (o)). Due to 
the uniform convergence of the series for c*>' e (see Exercise 7.5) 


* See, e.g., Messiah [1962], p. 496. 
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we can integrate term by term, thus arriving at the following relation: 

( 7 - 71 ) AH = Z 47rt"*F; m (oj) I* drr 2 V 0 (r)j\(kr) 

1=0 m=—l J 0 

X f da, 1 Y* H (-a, 1 )0 ( k +) (r<o 1 ). 

J °s 

From the Lippmann-Schwinger equations (6.5) we immediately 
see that when V(r) is spherically symmetric &\+\r) is a function of only 
r, k , and cos 6 = r * k jrk\ 

( 7 -72) &l +> (r) = cos 0). 

Hence, by carrying out a change of variables in which to x is replaced by 
spherical coordinates of r with respect to k, we obtain by (7.18) in 
Chapter II 

(7.73) [ YU-^) 0< + V«b) dco, 

•>n s 

(0 for m=£ 0 

Vrr(2l + 1) f +1 &+\r, u) P t (u) du for m = 0. 

V J 

Consequently, (7.71) can be written in the form 

(7.74) T™(k; to, o,') = a t (k) P ,(<o • to') 

= £ E ««(*) y.»(«)y£K), 

Z=0 m=—l 

where the convergence is in the mean on Q B x Q s , and 

(7.75) a t (k) = \/%irkmi l f dr r 2 V n (r)j l (kr) f du&^\r,u) P t (u). 

J 0 J-l 

Thus, we have arrived at the following expansion for S(k): 

(7.76) S(k) = 1-2t n T(k) = £ Z |1?">(1 - Inia^k))^ |. 

1=0 m=—l 

Comparison with (7.61) yields the following explicit expression for the 
phase shifts: 


e 2id t (k) 


(7.77) 


= 1 — Inia^k). 
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The reader can compare the above results with the considerations 
in Section 7.4, and easily arrive at the conclusion that 

(7.78) S l (k) = k e [0, oo), / = 0, 1, 2,..., 

with Sf(&) given by (7.75) and (7.77) is indeed the S matrix in partial 
wave representation, whose existence has been established in Theorem 

7.3. 

The great importance of the above formula for the S matrix lies in 
the fact that in conjunction with (7.75) and (7.77) it enables us to 
analytically continue Sfk) in the complex planes of the variables l and k . 
The properties of the resulting complex S matrix S(k , Z) provide the 
foundations of a very fruitful theory of the S matrix, which has become 
the fundamental stepping-stone of much of the theoretical physics in 
the 1950’s and 1960’s. Since this book is concerned exclusively with 
functional-analytic methods, this topic is beyond its scope, but the 
reader is urged to consult the References for literature dealing with 
this very important subject.* 


Exercises 

7*1 ♦ Show that if f(t) is continuous and the improper Riemann 
integral 

/»<* /»+°° 

I 0 = lim fit) dt = fit) dt 
exists, then the improper Riemann integral 

/ = C e-‘*f{t)dt 

J 0 

exists for every e > 0, and lim e _^ +0 1 € = I 0 . 

7.2. Show that for any given \fj{ r) e ^(R 3 ) there is a constant C 0 
such that \(e~ iHot ^)(r)| ^ C 0 for all r g IR 3 and all t e R 1 . 


* A mathematically rigorous treatment of analyticity properties of the S matrix in 
potential scattering is given in the work by de Alfaro and Regge [1965], but most other 
textbooks on scattering theory also treat this subject at an adequate level of mathematical 
rigor—whereas the same certainly cannot be claimed when it comes to the functional 
analysis aspects of scattering theory. 
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7.3* Prove that for k e [0, oo) with fcpj'lm £ \ the integral 


| T a) (k; to, to')| 2 da> da>’ 

£2 g X£2g 

exists, provided F(r) satifies the prerequisites in Theorem 6.4 and 

Tll) (k; w, to') = ~(2^kf ke} ,( to). 


7.4. Suppose [£ v (*) - £*/*)]/« =f t ,i= 1,2 for 

(Aj, Aj ] o (A 2 , A,'] = 0 , 

where i? A (A) is the spectral function of S(k), and A x , A 2 , A/, A 2 ' e (0, 2 tt) 
Prove that T{k)f 1 J_ T(k) f 2 . 

7.5. Prove that for any aeR 1 


= I *'*(2/ + 1 )ji(*)P l (u), -1 < « < 1, 

1=0 

and that the convergence is uniform in u e [— 1, +1], 

7.6. Show that for any /3 > 0 and any r 0 e R 3 

- (^f~P 

lf <Ar 0 (r) is a wave packet with a Fourier transform of the form 
<£- ro (k)=exp 

Use this result to show that for any bounded Borel set B 

|| E°(B)r*V4"W 

= ^ |(«-"V& # )(r)|* dr < rf\B) ^ 


7*7* Any Schroedinger-picture free state 

A r ) = ^ 0 f )( r )> 

corresponding to ^jeL a (R 8 ), displays a tendency of “spreading out” 
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in space as t —> — oo. This conclusion can be reached by establishing 
the evanescence of the wave packet 0j( r ) from any bounded Borel set B: 

Combine the result of Exercise 7.6 with Lemma 1.1 and Lemma 7.1 
to prove that the above limit is indeed equal to zero. 

7*8* Prove that any interacting state ifs ( represented by a wave 
packet, 

0,(r) = (^0 o )(r), 0o eL a (R®), 

which has an incoming (outgoing) asymptotic state 0f n (0° ut ) evanesces 
from any bounded Borel set B in the configuration space when t — oo 
(t ~* + °°)* 


8* Fundamental Concepts 

in Multichannel Scattering Theory 

8.1. The Concept of Channel 

The outcome of the mutual interaction of three or more particles can 
lead to results which are qualitively different from the possibilities 
encountered in two-particle interactions. Indeed, a system of two 
isolated particles can either be found forever in a bound state, or the 
two particles are free before and after collision. However, in the case 
of three particles , S 2 > and > there already are other possibilities 
in addition to the two alternatives of , S 2 , S 3 becoming eventually 
free or constituting a bound system For example, it is 

possible that and S 2 will stay bound forever, thus building a new 
system ®!-® 2 , while S 3 becomes free, etc. Each one of these possibilities 
determines a particular outgoing arrangement channel of the scattering 
process between , S 2 , and S 3 . 

In general, an arrangement channel (or clustering) in the scattering 
process of n particles , S 2 >•••> i s a particular partitioning of the 
set {£>! , ® 2 >•••» i nto a number of nontrivial subsets, called clusters . 
When all the particles in each cluster are actually bounded together 
either before or after collision with particles in other clusters, then we 
say that the clusters constitute fragments of the collision process. Clearly, 
we can talk about incoming channels and outgoing channels, depending on 
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whether the particular channel we are considering refers to the system 
before or after the scattering had taken place. 

In the case where some of the particles in the system {©! , S 2 ,..., S n } 
are identical, not all partitions of the set , © 2 S n } will represent 
distinct arrangement channels. This is due to the presence of Bose- 
Einstein or Fermi-Dirac statistics, respectively (see Chapter IV, §4) 
which rules that identical particles are indistinguishable. For example, 
if ©! , S 2 > an d are identical, then the channel > ^ 3 } i n which 

©! and S 2 are bound, while S 3 is free, is identical to the channel 
{©! , ® 2 ~® 3 } m which ©j_ is free and S 2 is bound to S 3 . As a matter of 
fact, in this example both above arrangement channels are also identical 
to {S 2 , ©i-Ss} so that we have only three distinct arrangement channels: 
{Si , S2-S3}, and {©! , S 2 , S3}. 

If each fragment F is in an eigenstate of a set of commuting 
observables which are constants of motion in that particular arrangement 
channel, then the system is said to be in the channel determined by 
eigenvalues of the observables in the family \J&G& . 

In practice, it is often convenient to take an (9$ containing the internal 
(binding) energy of the fragment, its internal angular momentum 
(called the spin of the fragment), etc. If the internal energy of each 
fragment F has been prescribed, then we call a channel determined 
by the given internal energy eigenvalues an energy channel* 

In the case of many-body (three or more particles) scattering we can 
distinguish between elastic and inelastic scattering t by looking at the 
distribution of energy among fragments before and after the scattering 
has taken place. Elastic scattering is, by definition, the scattering process 
in which all the fragments and their internal energies are preserved; 
for example, if the initial energy channel consists of a particle S x being 
scattered from a bound system S 2 -©3 of two particles, then the scattering 
is elastic if after the scattering has taken place the particles S 2 and S 3 
are still in a bound state of the same energy as the initial bound state, 
i.e., if no kinetic energy has been transferred from S x to the internal 


* We depart somewhat from conventional terminology, in which a channel with respect 
to some of the above-specified observables is simply called a channel. However, since 
there does not seem to be complete agreement among different authors on the precise 
contents of the set 6& when a channel is defined, the above general approach seems quite 
desirable. 

+ The distinction can be made also in the case of two-body scattering in an external 
field. In fact, the external field describes the interaction of the two particles with one or 
more additional particles which are not explicitly included in the system, but are effectively 
represented in an approximative manner by the “external” field. In that case, each one 
of the particles in the system can be bound to the field. However, it is clear that two-body 
scattering in an external field is, in fact, many-body scattering. 
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energy of © 2 -© 3 • Naturally, any scattering which is not elastic is called 
inelastic. 

Suppose all the fragments in an energy channel are in their internal 
energy ground states and that AE is the smallest energy gap between 
the internal energies of these fragments and the energetically nearest 
nonground eigenstate of their respective internal energies. Then, 
obviously, no inelastic scattering can take place as long as there is not 
sufficient kinetic energy to be transfered to the & 0 th fragment to raise 
it to an “excited” state of internal energy Ej^ + AE. This amount of 
energy is then called the threshold of the inelastic process in that particular 
energy channel. 

If in an inelastic scattering the fragments themselves are decomposed 
and new fragments result as an outcome of the scattering process, then 
we talk of a rearrangement collision\ for instance, in the above example 
this is the case when a bound state of © x and S 2 is formed, while S 3 
goes free after the interaction has taken place. 

It should be immediately realized that a rearrangement collision cannot 
take place at any distribution of energy between the different fragments. 
For example, in an energy channel of , S 2 -S 3 } a minimal energy 
equal to the internal binding energy of S 2 -(5 3 * s necessary for the decom¬ 
position of the fragment © 2 -© 3 occurring in the initial state. This energy 
is then the threshold energy for a rearrangement collision. 


8.2. Channel Hamiltonians and Wave Operators 

We shall illustrate the mathematical counterparts of some of the 
physical concepts introduced above on the case of three-particle potential 
scattering. For this purpose we consider three distinct particles which 
interact with one another via two-body forces. Here, the statement 
that we are dealing with two-body force means the total potential can be 
written in the form 

(8.1) V(r x , r 2 , r 3 ) = V 12 (r x — r 2 ) + F 23 (r 2 - r 3 ) + V 13 (r x - r 3 ), 

and therefore each one of the particles interacts with each other separately. 
In addition, we simplify our considerations by assuming that these forces 
are of finite ranges p iy -, i.e., V is (r ), i <j = 1, 2, 3, vanishes for r > p^ . 

The total Hamiltonian of the above system is given by the Schroedinger 
operator 

(8.2) H = H 0 + V 12 + V 28 + V 13 , 
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where H 0 is the free Schroedinger operator 

(8.3) (H„<A)~( Pl > P* > P*) = (|£ + ft*. - P* * Ps)> 

and V , i < j , is the potential energy operator between the ith and the 
jth particle, 

(8.4) (F,,0)(ri, r 2 , r 3 ) = ~ 0(r x , r 2 , r 3 ). 

Naturally, these operators act in the Hilbert space L 2 (IR 9 ). 

The arrangement channel , S 2 > £> 3 } m which all three particles 
are free is characterized by the fact that all these particles are so distant 
from one another as to be outside the ranges of the forces with which they 
interact on one another. Hence, for such distances V{r x , r 2 , r 3 ) vanishes 
and the Hamiltonian H ( 123 ) of the system is given effectively by the 
operator H 0 , which is said to be the Hamiltonian of that particular 
arrangement channel. 

In the arrangement channel {©! , ® 2 ~® 3 }> the particles S 2 and S 3 are 
within the range p 23 of their mutual interaction forces, while © x is 
outside the range p 12 from S 2 > and outside the range p 13 from S 3 . 
Consequently, in this case V(r 1 , r 2 , r 3 ) becomes equal to F 23 (r 2 — r 3 ), 
and the Hamiltonian of the arrangement channel is #(i >2 _ 3 ) = H 0 + ^23* 
We see that in the present situation we are dealing with a much more 
intricate framework then in two-body scattering: instead of a single 
“free” Hamiltonian, we have a different Hamiltonian for each one of the 
five arrangement channels. 

We can write the Hamiltonian of each one of the arrangement channels 
as the sum of the Hamiltonians of each one of the fragments in that 
channel. Thus, for example, 

(8- 5 ) #(1,2-3) = #<i) + #( 2 - 3 ) > 

where, in general, denotes the kinetic energy operator of the ith 
particle, so that in the momentum representation, 

(8.6) (H«)0)~ (pi, p 2 , p 3 ) = P i 2 l2nii 0(p x , p 2 , p 3 ), i = 1,2, 3, 

and #( 2 _ 3 ) describes the system {S 2 -S 3 }; we can write #( 2 - 3 ) in the form 

(8*7) #( 2 - 3 ) = #( 2 , 3 ) 4" ^23 9 


where f/( 2 , 3 ) is the kinetic energy of the fragment {© 2 -(5 3 }, 
(8*8) #<2,»> = #(2) + #<8) » 
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and V 23 describes the interaction between the two particles in the 
fragment. 

When the system in some arrangement a and there is no interaction or 
negligible interaction between the fragments, then the state W(t) = 
e~ iIItl F(0) of the system is described quite well by the states 

(8.9) tf'f (t) = e~ iHat V° x (0) 

provided they are oc-channel strong asymptotic states in the sense that* 

( 8 . 10 ) lim || ^(0 — ^ x (0 || = 0 , 


where “ex” stands for “in” (t -> — oo) or “out” ( t -> +oo), depending 
on whether the channel is an incoming or outgoing channel, respectively. 

In two-body scattering we could formulate a general consistent time- 
dependent scattering theory for any two self-adjoint operators H and H 0 . 
However, in multichannel scattering one has to be careful and establish 
from the very beginning whether the channel description is indeed consis¬ 
tent, since it is not a priori clear that some interacting state might not have 
two or more distinct asymptotic states satisfying ( 8 . 10 ). In order to be able 
to find the conditions under which such ambiguities in the asymptotic 
description do not occur, we have to understand more fully the structure 
of the arrangement channel Hamiltonian H a . 

Let us consider the arrangement channel , ® 2 -S> 3 } m the scattering 
of the three particles , S 2 > and £>3 interacting via the two-body poten¬ 
tial (8.1). The fragment S 2 -® 3 is asymptotically in a bound state 


(8.11) 3 >(*) = exp(-z7/ (2 _ 3) 0 yV 3 ) (0) 


of S >2 an d • 

Let us assume that ^( 2 _ 3 )( 0 ) is represented by the wave function 


( 8 . 12 ) 


0( r 2 > r 3 ) — ( 


m 2 r 2 + m s r 3 
m 2 + m z 


)nr s -r 2 ). 


where according to §5 of Chapter II, p' describes the state of the center 
of mass of the system S 2 -£ 3 , while ^"(^3 — r 2 ) describes the state of the 
two particles in relation to one another. The two particles are in a bound 


* In fact, the remarks made in §2.6 still apply, i.e., the use of norm topology is too 
restrictive to apply when long-range forces are present—only expectation values of 
observables actually being of importance. 
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state when i/r"(r 3 ~ r 2 ) is an eigenvector 

(8.13) Hit 3) y = E (2 _ 3) y, £ (2 _ 3) < o, 

of the internal energy operator 

(8-14) Il\ts) 3 - 2 ^ A + ’ 

where A is the Laplacian in the variable r — r 3 — r 2 . Naturally, the most 
general ^( 2 -3)(0) has to be an element of the closed subspace of L 2 (R 6 ) 
spanned by all such vectors (8.12), corresponding to all eigenvectors 0" 

of 

When S 2 -S 3 is asymptotically in the state ^( 2 _ 3 )(0 eL 2 (R 6 ) and 
is asymptotically in the state } F( 1 )(t) f then the entire system is in the state 

(8.15) ® ¥V 3) W eLWY 

If ^( 2 -3)(0) is of the form (8.12) and iff" is represented by the eigenvector 
in (8.13), then it is easily seen that 

(8-16) H (l . 2 _ 3) <^ ex (0) = (H (l) + //£_“> + H\t s) ) ^ ex (0) 

= (ff (l) + + E (2 _ 3) ) >F ex (0), 

where 22 ( 2 - 2 ) i s the kinetic energy operator of the fragment ® 2 -S 3 , 

(8.17) (tf£3^)~(pa , Pa) = 2 (mf+m 3 ) ^ ’ p3 >’ P * 3 = Pa + Pa • 

The quantity | 2?( 2 _ 3 ) | is called the internal energy of the fragment 

®2-®3 • 

Naturally, it is often desirable to consider energy channels with respect 
to such internal energy operators. This is possible since in general, 
as well as in the above particular case, these internal energy operators 
commute with the arrangement channel Hamiltonian, and therefore 
they are constants of motion for the clusters in the given arrangement 
channel. 

Let us assume that , S 2 -S 3 } is an incoming energy channel, and 
that the kinetic energy part of iff(r 2 , r 3 ) has a sharp distribution | $'(^ 23 ) 1 2 
of momentum, i.e., that the incoming momentum of ® 2 -S 3 is prepared 
very accurately around some value P 23 . We can choose the system of 
reference in such a manner that P 23 ^ 0, and consequently 

(8.18) , Pa , Ps) « ' + £< 2 - 3 )) # in (Pi. Pa > Ps)- 
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Hence, we see that in this case the incoming particle © x must have an 
amount of kinetic energy at least equal to | E ( 2 _ 3 ) | in order to be able to 
impart it to S 2 --S 3 m the collision process and break up that fragment, 
thus giving rise to a rearrangement collision. Consequently, in this case 
—E( 2 - 3 ) represents the threshold energy for a rearrangement collision 
of the system {©j. , S 2 > £> 3 } m the given incoming energy channel. 

The reader can easily verify that the Hamiltonian H a of any of the 
above arrangement channels of the system {©j. , S 2 , S 3 } is the sum of a 
kinetic energy part of the center of mass of the fragment, and of an 
internal energy part For example, in the arrangement channel 

{Si, s 2 -s 3 }, 

/q 1 Q\ Tjrkln _ 17 1 rrkin ij^t _ rrint 

(O.iy; "(1.2-8) — “( 1 ) H(2-Z) 9 "(1.2-8) — "(2-3) > 

in the arrangement channel {©j. , S 2 , S 3 }, 

(8.20) = H (l) + H (2) + H l3) = H 0 , = 0; 

in the bound-state channel {©j-Sg-Sg}, is the kinetic energy 

operator of the center of mass of the entire system, and 

(8.21) HjfV s) = tfuVs) + V n + V 23 + V 13 

It is obvious that the kinetic energy operators of all channels commute 
with one another since they are all functions of the compatible momentum 
observables of the three particles. We also know from §5 of Chapter II 
that each center-of-mass kinetic energy operator commutes with the 
corresponding internal energy operator. However, in general, the 
Hamiltonians for different arrangement channels do not commute among 
themselves. This is easily seen to be so, for example, in the case of 
{©i, S 2 , S 3 } and {©!~'©2 , ® 3 } when the channel Hamiltonians are 
H 0 and H = H 0 + V 12 , respectively. 

Following the guidelines set up by the above example, we postulate 
that in any multichannel scattering process a unique channel Hamiltonian 
H a is attached to every arrangement channel a. Let us denote by 
the sets of all vectors/which are not eigenvectors of H a and for which the 
respective strong limits 


s-lim e iHt e~ iHoct f 

*-»=F 00 

exist. By applying Lemma 1.2, we immediately conclude that are 
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closed subspaces of which might contain only the zero vector, as 
often happens in practice (see Exercises 8.1 and 8.2). Hence, we can 
introduce the projectors onto and define the Moller wave 

operators for the channel oc by the formula 

(8.22) Qf = s-lim e mt e~ iH ^ E M<a) . 

t-*T oo 1 'j. 

The above formulas are obviously straightforward generalizations 
of (2.1). Hence, as mathematical objects, will have for each channel oc 
the same properties as Q ± . In particular Q ( ± } are partially isometric 
operators with initial domains and final domains R^, which 

coincide with the ranges of these operators. The intertwining properties 
(2-3)—(2.5) 

(8.23) Q ( ± ] e itH “ = e im , /eR\ 

(8.24) E h «(B) = E S (B) Q ( ? } , Be® 1 , 

(8.25) Qf HJ = HQ±*f, f e® Hx , 
and the relations (2.10) and (2.11) 

(8.26) a£*ag = e m m, 

(8.27) 0£q£*=E r u ) , 

will hold in the present case. Theorem 3.1 is also still valid, so that 
= s-lim e f e €t e lKt e~ lHott £ M ( a ) dt , 

e-»+0 J _ 00 ± 

(8.28) 

= s-lim £ f” e- £t e iHt e~ iH ^ E„ {ci dt. 

e-»+0 J 0 ± 

Moreover, the above two formulas can be taken to be the starting point 
of the time-independent approach to multichannel scattering (see 
Section 8.6). 

8.3. The Uniqueness of Channel Strong Asymptotic States 
Take now /e Mo a) = M+* n M! a) and write f ± = Q { ±f Since 


(8.29) Urn || e~ im f±* - «-"■•/1| = Hm ||/i a) - e im e~ iH ^f\\ = 0, 
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we see that e~ iH ^ f is the incoming asymptotic state of the interacting 
state e~ iHl f ( +\ and the outgoing asymptotic state of the interacting state 
e~ iHt f { *\ The questions now arises, however, whether e~ iH ^f is the only 
incoming (outgoing) asymptotic state of e~ iHi namely, there 

certainly cannot be another incoming asymptotic state g of e 
in the same arrangement channel, but, in general, there could be some 
other incoming asymptotic state e~ iH P l g of in some other arrange¬ 

ment channel j8 =£ oc. 

It was pointed out earlier (in the three-body case) that we cannot expect 
that such a uniqueness of asymptotic states would hold for any arrange¬ 
ment channel Hamiltonians H a picked up at random from the family 
of self-adjoint operators in To prove such a uniqueness of asymptotic 
states we have to restrict the families of candidates for arrangement 
channel Hamiltonians by requiring that such operators obey certain 
conditions, which are dictated by the physical situation at hand. 

Extrapolating from the example considered earlier of three-body 
scattering, it seems sensible to require that every channel Hamiltonian 
H a on and on be the sum 

(8.30) H a = H* in + H™ 

of a center-of-mass kinetic energy part H* m and an internal energy 
part here H is the operator which represents the sum of the 

kinetic energies of the centers of mass of all clusters, while the operator 
Hl nt represents the sum of the internal energies of all clusters in the 
arrangement channel oc. Judging from the wave mechanical three-body 
problem, it is also reasonable to postulate that Hl nt has a pure point 
spectrum and commutes with H ^ m , while H^ in has only a continuous 
spectrum, and that all kinetic energy operators H commute among 
themselves. 

When the channel Hamiltonians have the above indicated structure 
we can prove that incoming and outgoing (strong) asymptotic states of 
any interacting state are uniquely determined by that interacting state, 
and consequently the description of the scattering experiment in terms 
of the asymptotic behavior of the system is completely unambiguous. 

Let us first understand better the problem of uniqueness in mathe¬ 
matical terms. We have seen in (8.19) that if t ) = e~ iHt W 0 is an inter¬ 
acting state, and if W 0 = Q^Wq 1 , then W in (t) = e~ iHott W™ is an in¬ 
coming asymptotic state of W(t ). If ^^(t) = exp(— iH^t) Y'J 11 were 
another incoming asymptotic state of ¥*(£), then we would have = 
^(V)ipin Th us> y*i n (£) could be a state different from ’f^Z) if and only 
if oc =£ oc' and W Q = Q^Vq 1 = i.e., if W 0 belonged to both 
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ranges R^* and R+'* of and respectively. Consequently, the 

necessary and sufficient condition which has to be fulfilled in order to 
have a unique asymptotic state corresponding to an interacting state 
is that the different ranges R+* should have no state vectors in common. 
Naturally, a similar conclusion holds for the ranges R { “\ which are also 
required to have no state vectors in common. 

The two ranges R^ } and R^ r) , oc ^ a', will have no state vectors in 
common if, in particular, they are orthogonal to one another; in fact, in 
that case R£ } and R^' } have in common only the zero vector which is 
not a state vector. 

The following theorem shows that if the channel Hamiltonians have 
the earlier mentioned structure, which is reflected in (8.30), then 
R+° _L R+° for all oc oc and R!? ) _[_ R^ r) for all ^ y8'. Consequently, 
under those conditions every interacting state has at most one incoming 
and one outgoing (strong) asymptotic state. 


Theorem 8*1 ♦ Suppose that for a given system the arrangement- 
channel Hamiltonians H a , acting in the separable Hilbert space J#*, 
are of the form (8.30), where all commute among themselves, while 
each i/ a mt has a pure point spectrum when restricted to M^* and it 
commutes with the corresponding H* m . If oc and /3 are two distinct 
arrangement channels and — H* in has no point spectrum, then 
R£> _j_ Rf and R!r ) _j_ R ( «. 


Proof . It follows from the basic definitions of wave operators that 

(8.31) = (s-lim £< e >*(*))(s-lim Q^(t) £„<„>) 

C“* + °0 -j- 

Since i/* nt has a pure point spectrum on M^°, the integral in the spec¬ 
tral decomposition of i/* nt can be written as a sum: 

(8.32) H™f = £ A£ W ({A})/, fe n ; 

A6S(«) 


here the summation extends over all eigenvalues of corresponding 
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to eigenvectors in M^, and 2?“({A}) is the projector onto the eigenspace 
corresponding to the eigenvalue A. It should be observed that in this 
context it is indeed possible to sum over all eigenvalues A on account 
of the fact that there can be at most countably many eigenvalues of a self- 
adjoint operator in a separable Hilbert space. 

We immediately get from (8.32), by using in the process the commu¬ 
tativity of i/k in an( j 


exp {-iHJ) = expexp£ £<“>({A}) E„ w 

= exp(—£ exp(— i\t) £ ( “>({A}) £ M<a) , 

AeS(“) ± 


where the above sum, if infinite, converges in the strong sense. Since 
a similar relation can be derived for Erf® e tH0 \ and since H* iri and Hf in 
commute, we can write for any fe and£ e M^ ) 


(8.33) </|£m( B ) e iH ^ e- iH ^ E^ )g } 


1 Mte) e 

= E. (E, ,</*' 
A ' ss a ) ^ 


exp [i(H*» - H? in + A' - AMft>), 


where/,- = £W({A'})/and ft = E^({X})g. 

Let us take any two vectors / 6 Ml? and g e Ml? which have only a finite 
number of nonzero components / A ' and g A , respectively. Then, in view of 
the fact that < /1 e iH P t e~ iH * t g') is continuous and therefore integrable in t 
on any finite interval, we have 

(8.34) 

f T <f I e iHet e-^g} *=££ f I exp [i(H** + A' - - A )t] g,} dt. 

Since Hf in + A' — — A is a self-adjoint operator, we can apply 

the mean ergodic theorem (see Theorem 8.3 in Appendix 8.9 to this 
section) to each term in the sum of (8.34). According to this theorem 

(8.35) Mm ~ J # T </,-1 exp+ A' - A)*] £,> dt 
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where the above inner product is zero because the projector onto the 
characteristic subspace of H^ in — H^ in corresponding to the eigenvalue 
A — A' has to be zero on account of the assumption that H* ln — 
has no eigenvalues. 

By using (8.31), we obtain (see also Exercise 8.3) 

</1 Q^g> = Hm </1 E„(a *** \w?> 

= ~ j\f | £ M ( S) e iH * e*** E„ ( _ a) g} dt. 

If /e Ml p) and ^ e Mi a) have only a finite number of nonvanishing 
components/ A ' and^ A , respectively, then in view of (8.34) and (8.35), we 
conclude that £2 { * ] g ) = 0 for all such / and g . Since the set of 

all such vectors/e and^ e M^ ) is dense in and M^, respectively, 

it follows that the same is true for all /, g e This establishes the 
assertion that R^ X Ri? ) for a /3. 

A similar argument yields that R^ X R+° for a ^ /?. Q.E.D. 

The above theorem is valid under the restriction that H* m — H* m 
should have no point spectrum when oc ^ /?. It should be noted that this 
condition is certainly satisfied if H* in and H are two distinct functions 
of the momentum operators of the n particles partaking in the scattering 
process. The reader can easily convince himself that such is the case, 
for example, in potential scattering, so that Theorem 8.1 is always 
applicable in that case. 


8.4. Interchannel Scattering Operators 

In multichannel scattering we are faced with the possibility that a 
system prepared in some given arrangement channel does not have to end 
up in the same arrangement channel. Hence, obviously, no single 
scattering operator S could describe all the transitions which the system 
can undergo. Instead, to any pair (/?, oc) consisting of an incoming 
arrangement channel a and an outgoing arrangement channel /? we have 
to attach a different scattering operator, 

(8.36) 

describing the transitions between these two channels. In fact, if W(t) has 
the incoming asymptotic state S^f) = e~ iHott V^O), then the transition 
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amplitude to the outgoing state = e~ iHat ^^(O) of free fragments 

in channel j3 is 

(8.37) lim <¥'° ut (i) I = lim V?\0) \ e-* m W(0)} 

= <£>?’ ^ ut (0) I *P(0)> = <&i @) <P° ut (0) I Q? ^“(O)) 

= <n>) | 5 fi ^'“(0)>. 

From (8.23) we easily obtain for all t e U 1 

(8.38) S Ba e iH «* = «"«' = g! s) * e iH! 

= S ea . 


We can also derive in similar manner 

(8.39) S 0a E h «(B) = E h *(B) S 0(X9 Be 

One of the most important properties of the single-channel scattering 
operator is its unitarity. To establish the presence of some analogous 
property for multichannel scattering we have to consider simultaneously 
the entire array of all interchannel scattering operators. Then, using (8.27) 
we obtain 

(8.40) z s *Ab = Z (fi?*i3!r ) xfli v> *fl? > ) 

V V 

= fl?* (Z &-&-*) Qf = Z E R(y) Q f 

In the case of single channel scattering the identity of R + and R_ is 
a necessary and sufficient condition for the unitarity of S on M 0 (see 
Theorem 2.5). A reasonable generalization of that condition to the 
multichannel case would be to require that 

(8.41) ©rS* =©rL v) = r, 

V V 

where we tacitly assume that the conditions of Theorem 8.1 are satisfied, 
and therefore R^ _L R+* and R^ ) _]_ R^ ) for a ^ j8. The assumption 
(8.41) implies that 

z wf = Z = Of, 

V V + + 
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since is a partial isometry with final domain R { £K Hence, (8.40) 
yields 

(8.42) £ S* x S vB = &:>*Qf = K b E m m , 

V 

where S a/3 = 1 for a = /? and S aj8 = 0 for a ^ /?. 

A similar procedure leads to the conclusion that 

(8.43) £ = 8«A,m 


when (8.41) is satisfied. 

8.5. The Existence of Strong Asymptotic States 
in n-P article Potential Scattering 

Let us consider the wave mechanical case of n particles without spin 
interacting via two-body forces, so that 


(8.44) (/#)( ri ,..., r„) = ,..., r„) + £ F«(r, - r,) flr,,..., r.), 

*, 3=1 

i<j 

where, using the momentum representation, 


(tf O 0)~(Plv..,Pn) = Z g^( Pl ,.., P n) 


Let us choose an arrangement channel a containing the fragments 
The free motion of each one of these fragments is 
described by the Hamiltonian 


(8.45) 


h: 


(Jc) 


y + 

i<=gr(k) 


*<3 


^(Q*- 


■Q#)» 


where the summation is over the indices of the particles in that fragment. 
The channel Hamiltonian is 

(8.46) H a = H® + -+ Hi n «\ 

Take Rl lc) to be the center-of-mass position vector of the kth fragment, 
and let p, k) stand for some choice of internal motion coordinates, so that 
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(R^ } , p {k) ) completely describes the positions of all the particles in the 
fragment. Then proceeding in the manner indicated in §5 of Chapter II, 
we can write 

(8.47) Hi k) = -(l/2M< fc) ) + 77< fc)lnt , M (k) = £ m t , 

ie&rW) 

where M {k) is the mass of the fragment ^ {k) and H (k)vnt is the Hamiltonian 
of the internal motion of the particles in that fragment. If *Ai &) (pi &) )> ^ — 
1 ,...,n a , represent any eigenvectors of then e~ iH ^ ¥J(0), with 

¥^(0) represented by 

not 

(8.48) ..... r„) = n ^a%a\ ^ eL 2 (R 3 ), 

1 


is a state in which all fragments in the channel oc are moving independent¬ 
ly. Obviously, the closed linear manifold N (a) spanned by all the functions 
of the form (8.48) coincides with the set of all free state vectors in the 
channel a. 


Theorem 8*2* (Hack’s Theorem). Suppose the functions F^-(r), 
i h j = l>-> w, are Lebesgue square integrable on IR 3 . Then the 
initial domains Mo a) = M^ } of the Moller channel wave operators 
coincide with the closed linear subspace N (a) of L 2 (IR 3n ) spanned by all 
functions */r a in (8.48) obtained by letting \jj (k) vary over all eigenvectors 
of H {k)int and ¥**> over all of L 2 (R 3 ) for k = 1,..., rc a . 

In order to avoid very involved notation, we shall prove Theorem 8.2 
for the special case of the arrangement channel , ® 2 “® 3 } m three- 
body scattering. In this case 


(8 . 4 9) a _ J_ i„ _ A. - ^ A.„ + V M , 


where jw 23 is the reduced mass of the second fragment ® 2 -<s 3 an( f 


(8.50) 


r 23 = r 2 — *s 


R = 


m 2 r 2 + m 3 r 3 
m 2 + m s 


The space N ( “ > of free state vectors in this channel is the closed linear 
manifold spanned by all the functions 


. r 2 . r 8 ) = ^(n) V™(R) W(r 23 ) 


( 2 )/ 


/,< 2 )/ 


(8.51) 
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corresponding to all ¥ / a (1) , ¥ / c [ 2) eL 2 (U 3 ) and all eigenvectors i/r* 2) of 

ff (2-3)lnt3 __I_ Ar+ ^ 

On account of Wiener’s theorem (see Lemma 7.1) the linear set spanned 
by all functions (8.51) with W ik) , k = 1, 2, represented by functions 
which are inverse Fourier transforms of Gaussian wave packets 
exp [—(p — Pfc) 2 ], & = 1, 2, is dense in N (a) when p x and p 2 are allowed 
to vary over R 3 . For such functions we can easily compute 

(8.52) |(exp(-*^*)^ ) )(r)f 

a 

= | 4 [ l + (230pr) ] j ^ ex P[-«*( r - b *) 2 ]- 

where = m 1 , M {2) = m 2 + m 3 and 

t i / / t ^ 

b * = lfpr P*’ a * = 2 l 1 + \ 2 M$ J • 

Hence, according to Theorem 2.8, the strong limits 

s-lim e im H = H X + V 12 + V w 


exist for all We N (a) if the improper Riemann integral 

r+ co 

(8.53) f \\(V 12 +V 13 )e- iH ^^\\dt 

J —oo 

is convergent for all 0 of the above indicated form. Now, using (8.52) 
we get the following kind of estimate: 


^12 «■"“**« IP 


- J, w- - *> [ 4 (' + (sr))]' - "Ji 


[ 4 ( 1 + (- 


2x-,-3/2 


-))] 


2(m 2 + m 3 ) 

X exp [-^ ( m f tt * t - bi) ] I — r a )| a dr, dr, dr. 


l ^ I 16 (• + hsr) X 1 + (isrW))] 


2\ *1 —3/2 


(equation continues) 
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X exp[—a 2 (R - b 2 ) 2 ] ^ 2) (r 23 )| 2 ■ 5' 1 ’ *u 

c7 V r 12 y r 23 > 

N'+feyr- 

Hence, we conclude that (8.53) converges, and consequently N (a) = 

M!p) = = M{j°. 

With the existence of asymptotic states established, the next question 
of general interest is asymptotic completeness. If denotes the closed 
subspace of bounded states of a general n-particle system described 
in the Hilbert space ^ then in accordance with Definition 6.2, the 
theory is said to be asymptotically complete if (8.41) is satisfied and 

(8.54) = R © . 

In multichannel potential scattering theory there are at the present 
time proofs of asymptotic completeness in the three-body case for 
large classes of short-range interactions (Faddeev [1965], Thomas 
[1975]), as well as for special instances of n-body (n ^ 4) scattering 
(Hagedorn [1978], Sigal [1978a, b]), but the general situation is as yet 
far from being as satisfactory as in the two-body case. 


**8.6. Two-Hilbert Space Formulation of Multichannel 
Scattering Theory 

We have seen in §8.4 that in multichannel scattering theory we cannot 
express all relevant physical information in terms of a single scattering 
operator acting in the Hilbert space of the system, but that we have 
to deal rather with operators S 0ot of the form (8.36) for each pair (/?, a) 
consisting of an incoming and an outgoing arrangement channel a and 
j8, respectively. We can, however, represent all these interchannel 
scattering operators in terms of a single operator S J from a new Hilbert 
space of incoming asymptotic states into another new Hilbert 

space J(? 0XLt of outgoing asymptotic states if we build the direct sums 
(Ekstein [1956]) 

(8.55) ^r ln = © m^, ^r out = © m!_ s) 

a 0 

over all incoming and outgoing arrangement channels a and j8, respectiv¬ 
ely. We shall refer to and as the m- and out-asymptotic 

Hilbert space , respectively. 
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Let us therefore denote by J ± the identification (or injection ) operators 
which, by definition, assign to the vectors 

(8.56) /=©/.£ ^ ln , g = ®g e e ^ out . 

a 0 

the respective vectors £ a / a and from Naturally, the J ± operators 
are not in general invertible, but they are bounded transformations 
(in the sense of Definition 1.5 in Chapter III) from ^ ex , ex = in, out, 
into Therefore they have adjoints / ± + , which, by definition, satisfy* 

(8.57) <h\J±f> = </ ± t *|/>«, 

for all /e ^ ex and h e where <• | *) ex , ex — in, out, represent the 
inner products in the asymptotic Hilbert space ^ ex constructed in 
accordance with Theorem 6.1 in Chapter II. It is a routine matter to 
extend Theorem 2.7 of Chapter III to bounded transformations from 
one Hilbert space into another and thus conclude that J ± f are bounded 
transformations from into a^ ex . 

It is also easy to establish that the strong limits 

(8.58) Q± J = s-lim e iHt J ± e~ iH ™\ 

(8.59) H ln = © H out = © m, 

a 0 


exist as bounded operators from ^ ex into and that in fact 


( 8 . 60 ) Q + J = 0 

a 


Therefore their adjoints 


( 8 . 61 ) (a + J y = '£&?•, 

a 


Qj = 0 Q^\ 

0 

(Q_ J Y = ££)<?*, 


also exist as bounded operators, but from the Hilbert space of the 
system into the asymptotic Hilbert spaces Jf 7 * 11 and respectively, 

so that 


( 8 . 62 ) s j = (ajya+ J 

is a bounded operator from into 3t? out . 

* In general, if A is a bounded linear transformation from into M* 2 (see Chapter III, 
Definition 1.5) and <• | •>*, i — 1, 2, denote the respective inner products, then by 
definition (A f g | f} x = <g | Af } 2 , /e , g e Jt 2 . The existence and uniqueness of 
A 1 ' is proven exactly as in the special case where b 3tf 2 (cf. Chapter III, Theorem 2.7). 
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Let us denote by E+ ] and E { f ] the projectors onto M+* and M^ ) , 
respectively, when these two spaces are viewed as subspaces of 
and respectively. Then by (8.60)-(8.62), 

(8.63) E ( I ) S J E ( f = E^f/^'JJ J + £i ( fE ( f, 

so that the above operator is essentially* equivalent to S 0OC . In fact, 
we have for / and g given by (8.56), 

(8.64) | S J E^fy mi = <& | S B J a ), 

so that S J incorporates information about all interchannel scattering 
operators Sap • 

In Theorem 8.2 we have seen that in potential scattering M+* = 
M^ ) = M { 0 a) for all arrangement channels, so that we have a single 
asymptotic Hilbert space ^ ex and a single identification operator J: 

(8.65) ^ in = jT 0Ut = ^ ex , J + = J_ = J. 

Moreover, (8.41) is satisfied as part of asymptotic completeness. As 
easily seen, this implies that S J is unitary on the Hilbert space 
appearing in (8.65). 

Proceeding under the assumption that (8.65) is true, we can make 
the transition to stationary two-Hilbert space scattering theory by 
following exactly the same pattern as in §3 in the one-Hilbert space 
context. Thus we introduce the Bochner integrals 

(8.66) = T* e ±€t dt, 

Jo 


and then routinely establish that the following results are true (see 
Prugovecki [1973a, Theorems 2.1—2.4}): 

(8.67) W ± = = s-lim W e JT ex , 


=b i* 

A -H±ie 


jdjtr 


d,E“J 


T it 

A — H ex =F ie 


(8.69) E?(E)Q± J = Q± J E H ' >X (B), B e J 11 , 


* There is an obvious conceptual distinction between as a subspace of and 
the space M^ ,# of all vectors ® a '/ a ' in which have only the component / a different 
from zero. To avoid too cumbersome a notation we shall, however, ignore this distinction. 
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(8.70) J U F ± = W + w-lim (A - U ex ± U)-\J'H - II ex D d,E»V± 

>_oo 

(8.71) = JW + s-lim f +C ° (A - H ± U)~\HJ - JII ex ) d A Ef^. 

6~>+0 J _ QQ 

Qrjr i j 

(8.72) r _ - flJj-O.' 

- ;C^ 

1 l»-f00 f gy 

= ; (g - ». + «. - w - /*“> ^ ■ 

Furthermore, the spectral-integral expressions (8.72) for the T 7 operator 
can be related to a operator analogous to the ^(£) operators 

introduced in §4.8 for the two-body case. For example, the first set 
of formulas (8.72) yields (see Chandler and Gibson [1973]): 

(8.73) T J = w-lim w-lim j ^ d A E A j ^ 8 f ( A - m) + «?) d u E u , 

sr%) = /VU - JH ex ) + (pn - i7 ex / + )(£ - h)-\hj - ;// ex ). 

We note that if ^ ex = ^ and J = 1 (so that i/ ex = // 0 ), then 
(8.70)-(8.73) reduce to (3.34), (3.41), (3.52), and (4.114), respectively. 
Clearly, (8.70) and (8.71) can be justifiably called two-Hilbert space 
Lippmann-Schwinger equations . These equations remain valid even in 
the presence of long-range forces under the proviso that asymptotically 
compensating operators Z a [analogous to Z in (4.121)] are incorporated* 
into /. 


8.7. Multichannel Eigenfunction Expansions and T-Matrices 

In its fundamental aspects the approach to the formulation and treat¬ 
ment of eigenfunction expansions is the same in the n-body case as it 
has been in the two-body case, but the level of complexity increases 
drastically with increasing values of n . Furthermore, the Fredholm 

* For details see Prugovefiki [1973a], A variant of this approach has been extensively 
studied by Gibson and Chandler [1974] and Chandler and Gibson [1974, 1977, 1978]. 
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method of §6, used extensively in the n = 2 case, becomes inapplicable 
in its original form when n ^ 3. 

To formulate the concept of eigenfunction expansions in the multi¬ 
channel case we have to choose in accordance with Definition 4.2 
for each arrangement channel a complete sets of observables {X| a) , 
X2 0 ,...} and (yj a) , that are best suited for that particular channel. 

Among the X (a) -observables, the standard choices always include the 
center-of-mass observables Q ik l for each of the k = 1n a clusters 
[whose position variables in the corresponding spectral representation 
spaces for complete sets of observables, which incorporate , we 
have denoted in (8.48) by R* fc) ]. The corresponding center-of-mass 
momentum observables P ( c ^ [whose variables we shall denote by Pj/ C> ] 
are always included among the y (a) -observables, with the internal 
energies for each fragment being the next standard choice. Thus, in 
accordance with (8.48) the corresponding eigenfunction expansion 
consists of functions (in physical literature usually called stationary 
states ) of the form 

(8.74) ,PJ= fl 'fHWS 

1 


K = (Ri 1) ,-, Ri"“ > ), Pa = (Pi 1 *,-, Pi” a) ), 

where p {k) and co {k) are variables associated with the rest of the complete 
set of observables for the kth cluster. For example, in case of the arrange¬ 
ment channel (S 1 , S 2 — S 3 | [with which the equations (8.5)—(8.8) and 
(8. 11)— (8. 17) are associated], and ca 1 are redundant for the first 
cluster which consists only of © x , whereas for the second cluster 
1*23 = r 2 — r 3 play the role of p 2 and E ( 2 - 3 ) is incorporated among the 
variables in c o 2 [to be very specific, if P 2 — P 3 is an electron-proton 
pair, and the spin observables are ignored, then in accordance with the 
results of §7.7 in Chapter II, we could choose o> 2 = (Z, m , ri)\ 

The presence in (8.74) of the plane waves 

(8.75) <^«(Rj rt ) = (2tt)" 3/2 exrftP^R"), k = 1 ,..., « a , 

tells us that, as in the two-body case, 0^ are not elements of Mj a) . 
However, in general, if we make in (8.73) the transition to the con¬ 
figuration representation by expressing (for spinless particles) i? a and 
p a in terms of r x ,..., r n [cf. (5.10) in Chapter II for a specific example] 
so that & { P" 9(0ol becomes a function of r x ,..., r n e IR 3 , then as a 
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function of P a and co a , 


*> = 



r n ) 0(r x r n ) dr x - dr n y 


is in general a well-defined element of 


L 2 



dP f 


m 



for any ifs e M (a) C L 2 (R 3n ) [where N a denotes the total number of 
Y (a) -observables and v^ is the measure over the spectral representation 
space for the complete set of observables related to the internal degrees 
of freedom in the &th cluster]. Using the symbolic notation in (4.81), 
we can express the fact that, for all channels a , constitutes an 

eigenfunction expansion in by the shorthand statement 

(8.76) X J | dv a ( P a , I = l ex , 


n $ 

^=1 

where l ex denotes the identity operator in ^ ex . 

Likewise, the Green’s operator G ex (£) for the asymptotic Hamiltonian 
H e x [given by (8.59), but with H in = i/ out ] can be expressed as 

, \Hijdv a (p^co a )( 0 i P t ola \ 

(8.77) = (£ - H«)-i = £ f-, 

“ C - zTJPa^M^) + A®] 

where Xl k) is the internal energy of the kth. cluster in the a-channel, 
and it is in general a function of Hence, the Green “function” for 
the channel Hamiltonian H a is in general a complicated entity, 

& ( £ ola (K , P«) dv a (P a , coj <>*(*«', Pa) 

(8.78) G a (R a , Pa ; RJ, Pa '; Q = f---. 

C - l 2M ^ + 

which might turn out to be well-defined only in the sense of distribu¬ 
tions rather than as an ordinary function. Naturally, the explicit use 
of the theory of distributions can be avoided once it is realized that in 
general the integrations in P {k) and w ik \ k = 1,..., n a , have to be 
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executed after the integration in , p a , i? a ', and p a ' have been per¬ 
formed upon taking inner products with suitable wave packets. 

These complications make the task of reproducing the derivation of 
§§4.6 and 4.7 in the multichannel context generally unfeasible without 
the introduction of new techniques. True, on a purely formalistic 
level, it would appear that (8.70) unambiguously yields 


(8.79) 

(8.80) 


+ G„(A a ± iO) V&VJSL 
V a = H-H ai 


noc p(fc ) 2 

\ _ V _j_ 

a Lu 9 MW ^ a * 




(8.81) 

r noc 

= l.i.m. f G a (R x , Pa ; * a ', Pa '; £) Pa ') fl VcfVcf')- 

J 7-1 


However, the fact that F a is not square-integrable even in the 
presence of two-body forces with integrable potentials invalidates the 
technique used in the two-body case. A further warning that something 
is seriously amiss with purely formal extrapolations from the two-body 
case is provided by the observation that due to the presence of different 
arrangement channels, (8.79) does not have in general unique solutions* 
even in the case where Theorem 8.2 assures us that exist on non¬ 
trivial initial domains M^, and are unique. 

All the previous observations are applicable to multichannel T 
matrices, to whose derivation we turn next. 


* This can be easily established on a heuristic level by using the second resolvent 
equation to write for jS ^ a, 

mu±*) = _j-* c (A<“> — H ± ie)~ l 

= d=*c(A (a) - H ± , + (A (a) - H ± ie)-W $£><± c > . 

X a -L ~ * P’&><*>’0 X « * a P'B^’B 

When c —► “f" 0 we obtain 


<*>(0><±) = G (a) (A <a) ± iO)V 

P’p,<*>'B « P’ 0 ,aj’ 0 

since is not an eigenfunction of H a , and therefore the first term on the right-hand 

side of the previous equation vanishes. Thus, is another solution of 

(8.79) for any aeC 1 and any p =£ <x (see Lippmann [1956] and Epstein [1957] for further 
details). 
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According to (8.72), we have for fe Mg 0 and g e Mjf’, 

(8.82) Q I ry> = lim <£ I 7£>( e )/> = lim <£ I T^)f\ 

(8.83) 27rtT«>(e) = (f^J - fl«>)*fl<.“> t 2»*T«>(e) = - **?). 

By applying the same procedure as in §4.7, we formally obtain for 

i = 1, 2, 

(8.84) <£ | 7£W> = J dv B (P B ', <) <V) J MPa . -,) 

X W - I V B | <.<+>)/(*>« . *>«), 

(8.85) <£ | T^)f) = J dv s (P B ', «„') g*(P B ', tog') J <fo a (P a , «,„) 

X W - A a )(0^ ( - ) - fl | F a | 0.0/(P. , wj. 


where in accordance with the notation in (4.81) and (4.89) we have 
written 


(8.86) /(P a ,a,J = (4“ ) , w J/> 

= l.i.m. f ,..., r n )/(rj,..., r„) dr x ••• dr n , 

*'(R 3n 


(8.87) (4># Vfl | F@ | 0^) 


l.i.m. J ,..., r„) F 0 (rj,..., r„) r„) dr x - dr n 


with remaining symbols being defined along similar lines. We note that 
whereas, e.g., Wa was defined in (8.74) as a function of the center- 
of-mass position variables R* fc) and additional internal motion variables 
p {k) for each cluster k in the arrangement channel a, ^ is a function 
of the position variables r x ,..., r n of each of the individual particles 
S x ,..., 6 n in the system—the transition from one set of variables to 
the other resulting from equations such as those in §5.2 of Chapter II, 
relating two such sets of variables. Naturally, if a = j8 then one can 
use in (8.87) the cluster variables R {k) and p {k \ making sure, however, 
to also express V a in term of these variables. 

It should be recalled now that, whereas in our treatment of the two- 
body case we had already eliminated in the initial stages of our con- 
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siderations the center-of-mass variables, so that we have been working 
the Hilbert space Jf (1) (see §1.7) of the reduced particle, in the present 
n-body case, n ^ 3, that reduction has not been performed since, in 
general, it does not prove to be as advantageous as in the n = 2 case. 
However, if there are no external forces [and therefore V as well as 
each V a do not depend on the center-of-mass variables R of the entire 
system; cf. Exercises 5.1 and 5.3 in Chapter II], we can always easily 
dispose of three integrations in (8.87) by introducing R as one set of 
integration variables. Indeed, we only have to note that ^j> a) >£0 in (8.74) 
contains the product of exponential functions (8.75), from which the 
factor 

/ n <* , . 

(8.88) (2tt)- 3/2 exp tR £ P<*> 

\ fc=l 

can be extracted, and the same holds true for Hence, formally 

speaking, the R-“integration” gives rise to a S 3 -“function,” so that 
we can write 

(8.89) (0^, w - s | V B | *££>) 

( n/3 n a \ 

Epf - E Pi k) W.1I p .. 

j=i fc=i / 

(8.90) (0^H | Va | <><J 

( n/3 n a \ 

E P B W ' - E P" W. I T j (2) I P a , <0, 

1 fc=l / 

where the two bona fide functions of P a , co a , jP 0 ', appearing on 
the right-hand sides of (8.89) and (8.90) are called, respectively the 
“post” and “prior” versions of the on-shell T matrix for the multichannel 
collision process in question. Naturally, the appearance of the S 3 - 
“functions ,, merely signifies that in deriving (8.84) and (8.85), the 
orders of all the P a and P a integrations could not be actually reversed, 
so that, for example, the l.i.m. in (8.87) had to be interpreted in the 
context of the last integral in (8.84): in that integral the integration in 
R a and P a in fact have to be restricted to compact sets in order to justify 
the reversal of integrations in R a and P a , and upon letting those sets 
tend to [R 3n «, Fourier theory (e.g., Theorem 4.5 in Chapter III) has to be 
invoked in relation to the R-variables [the procedure being reminiscent 
to that leading from (5.9) to (5.10)]. Thus, the use of S 3 -“functions ,, 
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could be avoided, but in physical literature their explicit appearance is 
favored due to their providing a graphic illustration of the conservation 
of the total momentum for the entire system. 

In this same spirit of notational expediency, it is customary to combine 
the first operator relation in (8.72), i.e., 

(8.91) S J = l ex —27«T jr , 

with (8.82), (8.84), (8.85), (8.89), and (8.90) into a so-called 5-matrix 
formula that generalizes (4.107): 

(8.92) <P*', cV | 5 J | P a , aO 


= n « 

&=i 




P^R 


— 2m B(Xg(Pg, OJg) — XJP tt , a> a )) 

( 71/3 7l a \ 

X P< ?r - £ <Pg\ OJg' \T J \P a , a) a >. 

j=l 1 / 

Clearly, the first term on the right-hand side represents a symbolic 
variant of the formula (8.76) for l ex , and either one of the two versions 
of the on-shell T matrix in (8.89) and (8.90), respectively, can be used. 


**8.8. Multichannel Born Approximations and Faddeev 
Equations 

A “prior” version of the T matrix could have been introduced in the 
two-body potential scattering as well, but nothing new would have 
been gained due to the relationships 

(8.93) 0 k *(r) = 0_ k (r), <R>*(r) = ^(r), 

and to the presence of a single arrangement channel and therefore of a 
single interaction channel Hamiltonian, namely V. That this is not so 
in the multichannel case becomes clear as soon as we consider extra¬ 
polating the Born approximation method to the n-body case with n ^ 3. 

Indeed, by repeatedly iterating the Lippmann-Schwinger equation 
(8.79) we obtain a Born series for retarded stationary states, 


t + X (G*(A a + iOWSKt 


*(«) 


(a) 


n-1 


(8.94) 
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and by the same token, also a Born series for advanced stationary states, 


(8.95) 


0 


pf &><*>'& 


= 0 


< 0 > 

P' 




Inserting these results in (8.89) and (8.90), respectively, we obtain 
“post” and “prior” Born series for the corresponding “post” and 
“prior” versions of the on-shell T matrix. We immediately see that in 
rearrangement collisions already the first post and prior Born approxi¬ 
mations are going to be markedly different even on the energy shell, 
namely 


(8.96) | V B | <P£ ia J * I V. I J- « * & 


k(oc) 


,<e> 


p^(a) 


so that computationally the two versions of the Born series can lead 
to significantly different approximation schemes. 

Despite its wide usage in atomic and molecular physics, there is 
almost no knowledge of the convergence properties of the general 
Born series (8.94) and (8.95). The main difficulty stems from the fact 
that the Fredholm method of §§6.1-6.9 is no longer applicable already 
in the three-body case. 

This can be easily verified, for example, for the channel oc = {®i, 
S 2 > £> 3 } where, assuming two-body forces, we have by ( 8 . 2 ) and (8.80), 

(8.97) V a = V {lt2 , 3} = H-H 0 =V 12 + V 23 + V 13 . 

It is convenient to work in the momentum representation so that 
assuming that F^-(r) gL( 2 )(U 3 ) we obtain 

(8.98) (V^)~( Pij , P„ , p k ) = (2*y* f V tj (p H - ph) fai!, P« , p*) dp’ i} 

4 3 


by using center-of-mass coordinates for { ( Z i , and noting that 

(8.99) ~h ~h ^ijPij > 

( 8 . 100 ) r tj = r* — r j , = (w^r* + ^r.)/( w . + m .) t 

(8.101) = Kp, — mjpJKmt + m 0 ), = p t + p,. 


Hence, G a (£)V a = (£ — 1.2.3} equals the sum in i <j = 



8* Fundamental Concepts in Multichannel Scattering Theory 


623 


1, 2, 3 of the following three operators: 

(8.102) (G 0 (t)F w *)~( p«,P„,p*) 

ViAvu - p u) 


(2 tt )- 3 / 2 J 


y _/ P a i P# i Pfc 2 \ 

\ 2 ^ 2 Mu ^ 2m k ) 


" *A(P«3 > P«3 > Pfc) ^P# • 


The question is whether upon making the transition in (8.102) to the 
variables p x , p 2 , p 3 [through the use of (8.101)] and then eliminating 
the center of mass motion of the three particles by factoring the Hilbert 
space J^ {3) = L 2 (U 9 ) of the entire system into a tensor product @ 
e^ (2) [representing an extension of (1.26) to the n = 3 case], we are 
going to end up with Hilbert-Schmidt operators. Now, working in the 
center-of-mass frame amounts to holding 

(8.103) P = Pi + P 2 + P 3 = P« + Pfc 

fixed, and by Theorem 8.12 in Chapter IV and Exercise 5.3, the resulting 
operator in J^ {2) will be Hilbert-Schmidt if and only if it is an integral 
operator with square-integrable kernel. We see, however, from (8.102) 
that this will not be the case except if F^(p) = 0 almost everywhere. 

If G(£) and G 0 (£) denote the Green’s operators of H and H 0 , respectiv¬ 
ely, in the present three-body case, then by Theorem 3.2, 

(8.104) G(Q = G 0 (Q + G 0 (0(V 12 + V 13 + V 23 ) G(0, 

and our earlier conclusion implies that we cannot apply at present the 
Fredholm method as we had been able to do in the two-body case due to 
Theorem 5.7. To circumvent this difficulty, Faddeev [1961a] rewrote 

(8.104) in the form 

(8.105) G(0 = Go(0 + G(1> ® + G(2> (£) + G(3> ©’ 

G u \i) = GS)[V jk + V jk G„(0 <rUQ] G 0 (Q, k, 

where &~ J aa (Q is the three-body counterpart of (4.101) corresponding 
to incoming and outgoing arrangement channel a = {<, S 2 , S 3 }. 
Indeed, in view of (3.61) and (4.102), 

(8.106) t G (i \0 = GMV + VG 0 (0 srim G 0 (Q = Go(0 VG(Q, 

i -1 

and we recover (8.104) from (8.105)—the whole procedure is reversible. 
We note that S'iJJQ can be also viewed as a component of the 
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in (8.73), 

(8.107) STiXi) = E^ZTXq £<“>, M±> = M<,“\ 

for the incoming and outgoing arrangement channel oc = {Si , s 2 , s 3 }. 

Following Faddeev [1961a, b, 1963], we can proceed by using the 
two-body problem to solve the three body one. Hence we introduce in 
e^ (3) the ^-operators for the arrangement channel {©*, — £>*.}, 

(8.108) F jk (t) - V jk + V jk G jk (QV jk , G*(£) = (£ - H 0 - V jk )~\ 

These 3 "-operators correspond to the physical process in which and 
collide, whereas © < is ever present but never interacts with either 
©,. or . 

If we now write (8.106) in the form 

(8.109) G«>(£) = G 0 (£)^«>(£) G 0 (£), ^«>(£) = ^ + V jk G 0 (O^U), 

then in view of (4.103), 

(8.110) ^£,(0 = f 12 + v 12 + f 23 + (V 12 + f 13 + F 23 ) g 0 (O^L(0 

= * 12(0 + ^“(0 + 

According to one of the second resolvent equations we have 

(8.111) G 0 (Q = G jk (0 - G jk (Q V Jk G 0 (Q, 

and substituting this result into the defining expression of 
we get by (8.108), 

(8.112) *£>(£) = V ]k + V }k a k (0[^ a (0 - V jk G 0 (0 3T[M 

= + v ik G jh aw^o - ^«(o + v jk \ 

= + V ik G }k (0[^JQ - ^(0]. 

Finally, taking into consideration that by (8.108) and (4.102), 

(8.113) V jk G jk (Q = ^(Q G 0 (0, 

we arrive at the Faddeev equations for operators : 

(8.114) .r «>(£) = + ^(0 G 0 («)[^-«>(0 + ^>(Q], 


i j ¥= k = 1 , 2 , 3 . 
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The operators are of no direct physical significance, but if 

we can compute them, then by ( 8 . 110 ) we have computed 
and in turn by (8.105) and (8.106) we have also computed the full 
Green’s operator. In fact, by (8.113), 

(8.115) G*(£) - G 0 (0 = G 0 (Q V jk G jlc (Q = G 0 (£) G 0 (£), 

so that the combination of (8.109) and (8.114) yields Faddeev equations 
directly for the components G {i) (Q of the full Green’s operator (8.105): 

(8.116) G«>© = G*(£) - G 0 (0 + G 0 © ^(Q[G«>(t) + <*«©]. 

In turn, proceeding heuristically, by (8.105) we can write a counterpart 
of the solution-type Lippmann-Schwinger equations for the advanced 
and retarded eigenfunction expansions in the a = , S 2 > £> 3 } channel, 

(8.117) < >(±> = ±Hm uG(\ a ± ie) 0<?> 

= <> + t - 

«=1 

(8.118) &\f£ = ±lim ieG U) (K ± u) 0% , P a = ( Pl , p 2 , p 8 ), 

and thus derive from (8.116) Faddeev equations for these advanced 
and retarded stationary states: 

(8.119) 0$<t> = 0 ( ( “>& - 0p“ > + G„(A a ± fO) ^*(A a ± iO)[0#,«? + 0^] 

(8.120) 0(“i)p* = ±lim «G ft (A a ± ie) , A a = £ (Pi 2 /2^). 

1 

In view of the developments in §6.2, it is obvious that are in fact 

the distorted plane waves for the two-body scattering between Sy 
and S k , in which, however, the (free) center-of-mass motion has not 
been removed, and which in addition incorporates the presence of the 
(free) particle ( Z i : 

(8.121) 0 < ( “>&(r 1 , r 2 , r 3 ) = 0,^) 0 Pjl (R rt ) 

Faddeev [1965] has shown that for a large class of potentials 
Vy( r i ~ r i)> operators Gp(£)^J fc (£) in (8.114) and (8.116) are 
completely continuous at positive energies Re £ ^ 0, and that in fact 
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the fifth powers of these operators are compact at all energies, so that 
the solutions of these equations can proceed by already well-established 
mathematical methods. 

All these statements stay true for any pair of incoming and outgoing 
arrangement channels oc and /?, respectively. In fact, in general, 

3VS) = E^rXi) E ( f =V a + V B a - K)-^ , 

and the derivation of the Faddeev equations for all pairs of incoming and 
outgoing arrangement channels can proceed simultaneously through 
the use of the two-Hilbert space formalism of §8.6. 

Yakubowskii [1967] has extended the Faddeev technique in its 
basic aspects to the n-body problem for n ^ 4, so that in gross outlines, 
mathematically exact and yet computationally implementable methods 
are available for the treatment of nonrelativistic scattering of n (point) 
particles interacting via two-body short-range potentials. 


*8.9. Appendix: von Neumann’s Mean Ergodic Theorem 

The following theorem was first derived by von Neumann [1932] 
in order to prove the quasi ergodic hypothesis of classical Hamiltonian 
mechanics, and it was used by us to obtain (8.35). 

Theorem 83* Suppose A is a self-adjoint operator in the Hilbert 
space and 

(8.122) U t = e iAt = f e™ dE x . 

J u 

Then for any 

(8.123) lim — l — t </1 U t g) dt = </| Em) g> , 

ht 2 — J ti 

irrespective of the mode in which t 2 — t x tends to infinity. 

We note that there is a unique bounded linear operator B{t x , t 2 ) 
(a Bochner integral) for which the relation 

</1 B( tl , t 2 )g) = —f t2 </1 U t g> dt 

l 2 — Ti J tj 


(8.124) 
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holds for all/, g (see Exercise 8.7). Hence, (8.123) states that 

£({0}) = w-lim B{t ,, t 2 ). 

12 — 

In order to prove Theorem 8.3, we write 

(8.125) <f\B( tl ,t 2 )g> 

= </l m , 4 ) E({0})gy + </1 , t 2 ) E(W- - { 0 »£>. 

From (8.122) we get 

</| U t E({0})g} = f e»*d(f\E,E({0})g) = </| F({ 0 })^>, 

J K 1 

and consequently, we have 

(8.126) </1 Bit ,, t 2 ) Ei{0})gy = </1 £({0})£>. 

Thus, we see from (8.125) and (8.126) that (8.123) is true if and only if 
( 8 - 127 ) Jim +oo </1 Bit ,, t 2 )h) = 0 , h = EiU 1 - { 0 })£. 

We shall prove that (8.127) holds by showing that 

II Bit,, t 2 )h || 2 = ~ f <2 dt' t dt"<U t .h | U e h> 

— h) J t x J t x 

converges to zero when t 2 — t x -> + oo. 

Since U t /* = U_ t > , we get 

II Bit, , t 2 )h || 2 = -- 1 t dt' t dt\h | U t ~_ t 'h). 

\ — h) J t x J t x 

After introducing in the above integral the new variables t = t" — t' 
and s = t r + t", we arrive at the following relations: 

x 2 h) J -(ti-ti) ^2^+UI 2 

J «+(i 2 —^l) 

= ( T- 1 i 2 J , |)<a I u t K > dt 

- ■"<'* - '■ - 1 '»J„, **»*• 


II Bit,,t 2 )k\\* 
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Since the integral 

f I h — h - 1 1 || d || E x h || 2 dt, S = K 1 x [-(< 2 - h), +(f 2 - <0] 

obviously exists, we can apply Fubini's theorem to interchange the order 
of integration in t and A. Thus, we obtain, after carrying out the integration 
in t. 


!l , t 2 )h || 2 


_ 2 _ 

(h - hf 


(• 1 — cos(/ 2 — t x ) A 

Jr 1 A 2 


d || EJi || 2 


r / sin — < t )A 
Jr 1 ' tj)A 


) 2 d || Efji || 2 . 


Let us split the domain IR 1 of integration in three parts (— oo, —77], 
(—■77, +77], and (77, +00), and majorize the integrand by 1 on (—77, +77] 
and by (^(t 2 — ti)^j )~ 2 on the other two intervals. Then we arrive at the 
estimate 


(8.128) || B& , h)h || 2 < f d || E h h || 2 + \ d\\ E x h || 2 

J <-«,») (h ~ hr V J r 1 


= li(E v - E_ n )h || 2 


(*. ~ h ) 2 V 2 1 


By choosing r) small enough, we can make the first term on the right-hand 
side of the above inequality smaller than e/2 for any a priori given e > 0; 
this is due to the fact that 


lim ||(£ n - E_ n )h I! = || £({0})A || = 0. 

r )-*+0 

Then, for such a value of 77, the second term in ( 8 . 60 ) can be made smaller 
than e /2 by choosing | t 2 — t x | sufficiently large. Hence 

lim || J3(f x , t 2 )h || - 0. 

t 2 — *i-»+°° 

Thus, ( 8 . 127 ) holds, and therefore ( 8 . 123 ) is true. 


Exercises 

8*1 ♦ (a) How many arrangement channels can there be in the 

scattering process of three distinct particles ? 

(b) How many arrangement channels have actually been observed in 
the scattering of a proton^, a neutron tz, and an electron e , i.e., for which 
channels a do we have M { 0 a) = { 0 } ? 
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(c) Give the conventional names of the fragments in all experiment¬ 
ally realizable arrangement channels in the scattering of n> p , and e 
interacting by means of nuclear and Coulomb forces. 

8*2* (a) Count the arrangement channels in four-particle scattering 

in which there are only two pairs of distinct particles. 

(b) How many of these channels are not empty in the scattering of 
two protrons and two neutrons (as far as our present experimental 
knowledge extends) ? 

83* Suppose that f{t) is a continuous function for t e [0, +oo), 
and that lim^ +00 /(^) = a exists. Show that 

Y f 0 m dt = 

8*4* Consider multichannel scattering with a finite number of 
arrangement channels a. Show that the operator 

S' = £ 

a 

commutes with e iHt and with E H (B). 

Remark. This operator was introduced by Jauch [1958b] as a 
candidate for a scattering operator in multichannel scattering. However, 
its inadequacy for this role is reflected in the fact that its knowledge is 
not sufficient to compute transition probabilities for scattering processes 
in which we have transitions between distinct arrangement channels 
(see also Exercise 8.6). 

8*5* Show that if R^ _L R+* and R { * ] _L R ( l ] for a ^ /3, and if in 
addition (8.41) holds, then 

S'*S' = S'S'* = E r . 

8*6* Show that if !P_(J) and have the respective strong asymp¬ 
totic states W^(t) = e-^ and y/out^ = e - iHp t then the 

transition amplitude (^(O) | ^-(0)) = <(yo ut (0) | S 0OC W!^(0 )) can be 
written in the form 

<^ + (0) | ^_(0)> = <^ ut (0) | Q^ H S f Q^W^( 0)> 

= <^ ut (0) | ^i 0) *5"^i a Vi n (O)>, 


provided that the conditions stipulated in the preceding exercise are 
satisfied, and that = Ml^ ) . 
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8*7* Prove that for any family U t of unitary operators defined by 
(8.122) there is a unique bounded operator B{t x , £ 2 ) which satisfies 
(8.124) for all/, 


References for Further Study 


Quantum scattering theory is the subject of a multitude of textbooks and monographs, 
and its basic aspects are routinely treated in just about every single textbook on quantum 
mechanics. That being so, we shall mention only a few monographs especially well 
suited as supplementary reading for this chapter. 

Newton [1966] treats all the principal aspects of the subject (concentrating, however, 
exclusively on physical ideas), and also provides an extensive bibliography covering most 
of the relevant material published by the mid-1960s. Taylor [1972] gives a very lucid 
exposition that incorporates as well as enhances the physicist’s traditional approach to the 
subject, and he also supplies a mathematically careful presentation of complex-analysis 
methods in scattering theory. De Alfaro and Regge [1965] treat this last subject in detail, 
and are doing full justice to its mathematical aspects. Amrein, Jauch, and Sinha [1977] 
give a mathematically rigorous presentation of most of the basic aspects of quantum 
scattering theory, whereas Reed and Simon [1979] concentrate on an array of topics of 
special mathematical interest. 

Mathematically closely related to the subject of scattering theory is perturbation 
theory for linear operators (see Kato [1966], Reed and Simon [1978]), as well as the 
theory of the structure of the spectra of Schroedinger operators (see Jorgens and Weidmann 
[1973]). 

Titchmarsh [1962] provides a detailed treatment of the theory of eigenfunction ex¬ 
pansions for second-order differential operators. A general treatment of the subject is 
supplied by Berezanskii [1968, 1978], and his second book covers many topics of direct 
interest in quantum mechanics. 

Some of the early papers laying the mathematical foundations of quantum scattering 
theory are those by Kodaira [1949], Povzner [1953, 1955], Jauch [1958a, b], Jauch and 
Zinnes [1959], Hack [1959], Zhislin [1960], Ikebe [1960, 1965], Green and Lanford 
[1960], Faddeev [1961a, b], Jordan [1962a, b], Grossmann and Wu [1962], Scadron 
et al. [1964], Hunziker [1964, 1965], and van Winter [1964, 1965]. Since the mid-1960s, 
the number of mathematical papers on scattering theory has been escalating at an ever- 
increasing rate. Hence the reader desiring to familiarize himself with more recent research 
in this area is well advised to consult published conference proceedings on the subject 
(such as those edited by La Vita and Marchand [1974], Nuttal [1978], and de Santo, 
Saenz, and Zachary [1980]), in addition to the references quoted in the appropriate 
context in §§3.8, 4.9, 4.10, 5.7, 8.6, and 8.8. 
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CHAPTER I 

1.3. In order to prove that is infinite dimensional note that 

all the polynomials x n , n = 1, 2 ,..., belong to ^“(R 1 ) and are linearly 
independent. 

1*5* 'V's is the vector subspace of V* which is a subspace of any 
other vector subspace containing S. 

2+2+ In order to prove that is closed under the operation 

of vector addition in ^(R 1 ), apply the triangle inequality (Definition 2.2, 
point 4) on ^°([ a , +#]), and then let a —> -j-oo in order to obtain 

a +°° V 1/2 

-oo 

a +°° \ 1/2 / x 1/2 

+ (J I g{x)\ 2 < +oo 

in case that f(x), g(x) e < g* 2) (R 1 ). 

2.3. Apply the Schwarz-Cauchy inequality on ^°([— a, +a]), and 
then let a —»- oo in order to prove that the integral 

/•+<» 

/*(*) gix) dx 

J —oo 

is convergent. 

3+1+ In checking the transitivity condition make use of the triangle 
inequality. 
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3*2* Employ the triangle inequality to obtain 

<2(£i > I 2 ) < d (£i , Vi) + d (Vi , Vi) + d (Vi » &)> 

and then reverse the roles of , £ 2 and rj 1 , r) 2 . 

33* Apply the triangle inequality first on £, £, 17 and then on £, 77 , £. 

3*5* | 17 ) — d(£ n , 17)1 < d(£, £ n ) according to Exercise 3.3. 

4*7* If S is countable then S X S is also countable. If we have a 
countable number of countable sets S ± , S 2 ,..., their union 5^ U Sg U ••• 
is also a countable set. 

4*8* Show this for l 2 (n) by specializing the methods employed in 
proving Theorem 4.3. 

4*9* In proving that [ S ] = ( S ), show that the closure (*S) of a 
linear space (£) is also a linear space. 

4.10. Derive |</1 *> —</„|* n >| < ||* II11/-/» II + ll/JI Ik - g n \\- 

5*2* In order to prove that the above integral converges for any 
f,ge use the same hint as in solving Exercise 2.2. 

5 *5* Note that the distance d(f,g) of any two distinct vectors 
/, g eT is ||/ — g || = \/2- Let S — {h l9 h 2 ,...} be a sequence dense in 
We can map T in S' by assigning to each/e T a vector h(f) e S such 
that d(f, h) = II/ — A|| < 1/a/ 2. This mapping between T and a subset 
of S is one to one, since || h(f) — > 0 for/ ^ S' 


CHAPTER II 

1*1* Consider R and S to be the subsets of SC = R U S. If T' 
denotes the complement of T C SC with respect to S£, then 

(R u S) - (R A S) = (R u S) n (R A S)'. 

By applying Lemma 1.1 we get 

(R u S) n (R A S)' = (Rn(R A S)') u(Sn(«A S)'). 

Further applications of Lemmas 1.1 and 1.2 yield 


R n (R A S)' = R n S = S n (S A R)'. 
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13* Prove first that if R, S e then R — S and R A S are in 
Then use the result of Exercise 1.1 to show that R n S e&. 

I A* Use precisely the same method as in proving Theorem 1.2. 

1*5* Prove that 

k k 

h u •" ^ h = (A - U 4.) <-> [h - U 7 ™) u ••• u h ■ 

X m =2 ' X m =3 1 

1 .9. Note that f|L A = (ULi *V)'. 

2A* Consider the case B k = [&, +oo), when lim^*, B k = 0 and 
— +oo, k = 0 , 1 ,... . 

3 J* Note that 

= gif)) = {£:/(*) > *(*)} - {*:/(*) > gif)}. 

33* Note that 

{4 f _ 1 ®SP./*(0 > c } = 0 {£/»(*) > c), 

n =1 

{4 Jnf / n (0 < c } = 0 {£:/.(£) < C}, 

n =1 

and use the results of Exercise 3.2 and Lemma 3.2. 

3 A* We can write 


fi x ) = 2 X^(x) + 4xz 1 (x) 

where SR^^h) is the set of all rational (irrational) numbers inside [0, 1], 
Compute the integral from (3.7) by noting that M 1) (SR 1 ) = 0 because 9^ 
is a countable union of all sets {r}, r e 9^ , for which ^({r}) = 0, 
while = [0, 1] — 9^ . 

Note that the Riemann integral $lf(x) dx is not defined. 

33* Note that if 

Wl w 2 

*i(f) = Z a <XR^\ *«(£) = I b iXs tf), 

<-l j-1 
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we can write ^(f) and s 2 (f) in the form 


n\ n 2 


s l(£) = Z E a iX* 

i =1 j=l 


rt\ n 2 

s 2 ,(£) = S X ^Xl^ns/f)* 

i=l i=l . 

3*8* Note that for a real function f(x) = f + (x) — f~M while 
\f(x)\ = /+(*) +/-(*). 

3.9. Note that/(f) - g(£) > 0 for f e #. 

4.1. In proving transitivity, note that for any functions f(x ), £(#), 
h(x), 

{x:f(x) ^ %)} C {{*:/(*) ^ £(*)} u {*:£(*) ^ %)}}. 


4.2. To show that ||/|| > 0 for / ^ 0, use Lemma 4.1. 

4.4. Prove first that C r Q is countable by noting that 91 X *** X 91 
(2 n times) is countable since 91 is countable. Then show that & s is 
countable by using the fact that 91 X 91 X C r Q is countable, and that 
S x U S 2 U • • • is denumerable whenever S ± , S 2 ,... are denumerable. 

4.5. Note that /x( R A -S) = Jo I X*M — Xs(*^)l d^(x). 

4.6. Show that the functions 


X{v}( x ) = 


for x ^ y 
for x = y, 


corresponding to all real-number y , constitute an uncountable ortho¬ 
normal system in L 2 ( IR 1 , /x). Note that for this measure /x, the Hilbert 
space L 2 (I, /x) is nonseparable even when we choose / to be a finite 
interval. 


4.7. Prove first the statement that CJ n is a Boolean algebra of 
subsets of U n by using the same technique as in proving Theorem 1.3. 

4.8. It is obvious that C «^ n , and consequently ^ 

Conversely, = {I n Q: I e */ n } C and therefore & Q n = 

I n fact, / n £? e f° r any /Ge/ n since 

I r\ Q can be written as a countable union of intervals contained in Q. 
For example, for integer k > 0 and finite I divide each one of the edges 
of I into k equal parts and take the union S k of all thus obtained 
subintervals of I which are also contained in Q\ then, in view of the 
fact that Q is open, we obviously have \J k= iS k = I n 
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5 * 1 * If we denote by the force with which the ith. particle acts 
on the kth one, note that 

n 

X F ik = WkVk » F^- = 0. 

i =1 

Employ afterwards the action-reaction principle (Newton’s third law), 
according to which F^ = — F^ , in order to derive that F = 0. 

5*2* Note that since 8 t | iff | 2 = ( 0 ^*) ^ + $*8$ is square inte¬ 
grate and continuous, we deal with Riemann integration and by 
theorems of the calculus we have (justify) 

4 - f , I I 2 ••• dr n = f dt I I 2 dr x ••• dr n . 

(It J [j$3n J [jj3n 

Choose a finite domain D of U Zn of the form X ••• X where 
D x D n are spheres centered at the origin , and denote with S k the 
boundary of D k C U z . By using (5.5) and Green’s theorem, derive 

m**,•••*„ = «£ ^rf [(v^w-^vMds,. 

D k=l k 

The above integrals along the boundaries S k vanish, when each S k 
expands to infinity in all directions, provided \^V k i/j \ = 0 ( 1 ^""“*), 

oc k > 2 . 


53* We have 8V/8X = kV)(Br k ldX); prove that 8r k /8X = 

Since similar results are valid for 8V/8Y and 8V/8Z , one can write 
VrF = ZLi V*F = —E Li Ffc = —(see the notation in 
Exercise 5.1). Since Fife = -F w by the action-reaction principle, one 
gets V R V = 0. 

6 . 2 * Show that the mapping (A,-»/») (/* l »•••>/*„)> A e <A ,•••> 

f n e , provides an isomorphism between <A © ••• © (?„ and 

&k x © ' ‘' © • 

6.3. Verify that the mapping (A , (A ,A)) -* (A ,A >A)> A e <A. 
A e «? 2 , A e , provides an isomorphism between A. © ('A © and 
0 • 

6 . 6 . ImO(/+?) = 0 implies that Im(/1 g) = — Im (g |/) and 
Im Q(f + ig) = 0 implies that Re (/1 g) = Re(g |/). 

6.9. Note that for any A A 6 > 8> &i e ^2 > 


II / ® g - A ® ft II < IK/ - A) <B> g II + IIA <8> ig - ft)ll- 
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Hints and Solutions to Exercises 


6*10* If g g and g x ,g 2 ,... e converges in norm to g , then 
g x , g 2 ,... converges in norm to some limit g e Jf 2 ; g is independent of 
the chosen sequence converging to g. The mapping g —► g of into 

is linear and preserves the inner product. For every h e 2 , there is a 
sequence fi ± , fi 2 ,... e ^ 2 converging to fi , and consequently h is the 
image of the limit h of , h 2 ,... e ^ , where h k is that element of / f r 1 
which is mapped into fi k . 

7*3* Use the ratio test which states that a series a k converges 
if limj .^00 | a k+1 la k | < 1 . 

7A* In order to show that A is not a Hermitian operator, take 
f(r, 0 , (f>) everywhere twice continuously differentiable, e.g., f(r) = e ~ r2 , 
and take g(r) = (1 /r) e _r2 . 

7*6* Prove by induction that for m < n 


<«-" / 2 P m I O = J" e~°P m L n (p) dp 


m \ r co gn—m 

sf /„ *■*> ^ - °- 

thus showing that <4 | / n > = 0 for k <n. 

Use the above result in the first step of the following derivation: 


II i n II 2 = J“ Up)) 2 d P = (-^) 2 J" (-i)« p* ^( P * e-) dp 

= ^S 0 pne ~° dp = h 


which shows that the Laguerre functions are normalized inL 2 ([0, +oo)]. 
7*8* The substitution x = e~ p yields 


f f*(p)g(p) d P = f 
J o J o 


/*(—In*) g(— \nx) 




\/x 


dx. 


This shows that the mapping 


f(p) 


m = 


/(-In x) 
\/x 


defines a unitary transformation ofL 2 ([0, + oo)) ontoL 2 ([0, 1]). Since by 
Lemma 7.4 the family of all polynomials p(x) = a 0 + a x x + ••• + a n x n 
is dense in L 2 ([0, 1]), the family of all functions 


e~ p/2 p(e~ p ) = e~ p,2 (a 0 + a x e~ p + *“ + a n e~ np ) 
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is dense inL 2 ([0, + 00 )). Each e~ p f 2 p(e~ p ) can be approximated arbitrarily 
well in the mean by a function of the form 

VoW + bJi (p) + **• + bj^p), 

since it follows from the last result of Exercise 7.7 (by taking an appro¬ 
priate s) that each e ~ p ! 2 e~ kp can be approximated arbitrarily well in the 
mean by a linear combination of Laguerre functions. 

7*9* The orthonormality can be deduced from the formula 
(see Chapter I, Exercise 5.4) 

h n (u) = 

by the methods employed in solving Exercise 7.6. To prove completeness, 
write for any feL^M 1 ) 

/(«) =/+(«) +/-(«). /±(«) = 2 [/(«) ±f(—«)]• 

Since we have, after the substitution u = \/p, 

r +oo -OO 

f±*(u) g±(u) du = 2 f±*(u) g±(u) du 

J -OO Jo 

= r {p- vi /±wp))*{p- vi g±wp)) dp, 

j 0 

the mapping 


f±{u) -> p-v%(V~p), /(«) e L\ Ri), 

is a unitary transformation of the subspaces L^U 1 ) of even and odd 
functions, respectively, onto L 2 ([0, + 00 )). The completeness of h Q (u), 
inL^R 1 ) can be deduced from the completeness of / 0 (p), /rfp),... 
in L 2 ([0, + 00 )). 


CHAPTER III 

1*2* Prove first that (Kf)(x) is continuous on I for any given f(x), 
f e L 2 (I), and therefore that Kf is square integrable in /. Show next that if 
fi( x ) == / 2 C*) almost everywhere, then (Kf^x) = (Kf 2 )(x) almost 
everywhere. Then check the linearity of the above mapping. 

13* Derive, by applying the Schwarz-Cauchy inequality in L 2 (7), 
that 

J / K(x, y)f(y) dy |‘ < [J^ | K(x, y)\* dy] | f(y) |» dy\. 
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Hints and Solutions to Exercises 


By using Fubini’s theorem (Chapter II, Theorem 3.13) derive from the 
above inequality 

<Kf\Kf> = \dx 1 1 K(x,y)f{y) dy |* 

< [J^/j K ( x >y)\ 2 d y] ll/ll 2 > ll/ll 2 = \ \f(y)? d y- 

1*5* Apply B to a sequence of nonnegative functions f n (x) e 
n = 1, 2 ,..., with support inside [—1/n, +1 /w], for which 

ll/« II s = f |/„WI 2 ^ = i, 

J —00 

and show that || Bf n || —> oo when n -> oo. 

2.5. Note that for fixed /, <f> f defined by <f> f {g) = (/ | ^) is a contin¬ 
uous linear functional. Use Riesz’ theorem (Theorem 2.3) to establish 
that for each fe there is a vector A(f) such that (/1 g) — <l>f(g) = 
< A(f) | gy , and check that the mapping/-> A(f) is a linear operator on 
Jf? . Use the boundedness and Hermiticity of (/1 g) to prove the 
boundedness (by taking g = Af) and Hermiticity of A. 

3.3. If E projects on M, dim M = m, choose {e x ,..., e m } and 
{e m+ 1,..., e n ) to be orthonormal systems spanning M and M- 1 , respectively. 

4*1 ♦ If dimJf’ = n f introduce an orthonormal basis {e ± ,..., e n ) in 
Jf. Prove that {Ve 1 ,..., Ve n } is also an ortho normal basis, and, therefore, 
that 0t v = 

\.2. Use the procedure employed in proving Theorem 5.6 in 
Chapter II. The difference between that case and the present case is that 
Theorem 5.6 applies to I = IR 1 , while at present I = [#, 6], and the norm 
to be taken is ||/|| = \f(x)\ dx. 

4.3. Write the equivalent of (4.8) for the general case, and apply 
Fubini’s theorem (Chapter II, Theorem 3.13) in order to be able to 
proceed in the same fashion as for n = 1. For instance, 


r , r , f +a J \ sin A*1 sin Xx n 

I dx-^ I dx% ,... I dx m j (m ~(— xj 

«/ a «/ _ a J _ a X 1 X t 

f°° sin Xx 1 r+“ r+“ , , sin, 

= dx i - - ••• dx% dx m f(u + x) —- 

•'a ‘X'l J — a J — a Xt 


sin Xx n 


approaches zero when A —> oo, due to Lemma 4.1. 
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4.4. If A e 2 A is a Cauchy sequence, then, due to the 
isometry of A> the sequence Af x , Af 2 ,... is a Cauchy sequence also. 
Hence, if gi , g% »••• G ®a converges to g e Sf B , then , Ag 2 ,... has a 
strong limit. The isometry of B can be deduced from the isometry of A 
and the continuity of B. 

5*2* According to Exercise 5.1, the eigenvalue problem for A is 
equivalent to solving 


X (A^ ^>ik) x k — 0 

k =1 

in A and x lf ... f x n . The eigenvalues A are the roots of the characteristic 
polynomial of || A ik ||, i.e., they satisfy 


det II A^ ~ ^ik II — 0, 


thus being the roots a polynomial of the nth order, which are at most n 
in number. Explain why the vectors / = x x e x + * • * + x n e n , corre¬ 
sponding to all the solutions x x ,..., x n of the matrix eigenvalue equation, 
span^. 

53* Note that || A || is a norm in (see Exercise 1.6), and, 

therefore (Chapter I, Theorem 3.1), every sequence convergent in the 
norm is a Cauchy sequence. 

5*5 ♦ Note that 

MW/ll “IkII | <\\A(t)f-g\\ 

for any g e . 

5,6. Write AB - A{t) B(t) = (A - A(t))B + A(t)(B - B(t)) and 
note that 

II (AB - A(t) B(t))f\\ < ||(A - A(t)) Bf\\ + || A(t)(B - B(t))f || 
while || A(t)(B - B(t))f || < c 11(5 - 5(0)/1|. 

5*7* See Chapter I, Exercise 1.4. 

5,8, Use the continuity from above and below of the measures 
ILj(B) = </1 E(B)f>, and the relation 

||</| (5(5)-£(£„))/> 
valid whenever B n C B or B n D B. 


= mB) - E{B n ))f\\\ 
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Hints and Solutions to Exercises 


5*9* First derive that || A* || ^ || A || from 

II ^*/ll 2 = if 1 AA*f> < ll/ll || AA*f || < M || ll/ll || i4*/||, 
and then reverse the roles of A and A* in the above derivation. 

5AO* Recall that the operator bound || A || has all the properties of 
a norm. In proving the last of the assumed relations use the result of 
Exercise 5.9. 

5AI* Note that \\e m+1 — e m || = ^2, so that Cauchy’s criterion is 
not satisfied. On the other hand, lim</ | e n > — 0 for any f e Jf 7 , since 
Zn=il</ I OI 2 < ll/ll 2 b y Lemma 4.1 of Chapter I. 

6*1 ♦ Note that (/ | g) = if | Ag) is a positive-definite bilinear form; 
apply Theorem 6.4 of Chapter II. 

6*2* The problem can be reduced to showing that if C ^ 0 and 
C ^ 0, where C = C*, £$ c = , then C = 0. This follows from the 

fact that </ | C/> > 0 and </ | Cf} < 0 imply </ | C/> = 0, fe , 
and from the generalized Schwarz-Cauchy inequality of Exercise 6.1. 

6*3* Note that for any figeJf? 

</1 Ag> = if | s-lim A n g) = lim </1 A n g) 

n-*<x> n-*co 

= lim (A n f | g) = <Af \ g). 

n~* oo 

6*5* It is sufficient to consider the case of strictly positive polynomials 
p(e iq> ) > 0 , 9 e [ 0 , 27t], since the more general case can be reduced to the 
above case by adding to the polynomial an e > 0 , proving the existence 
of a polynomial qfpc) for which | q e (x )| 2 = p(x) + e, and letting e—> 0 . 
In case that p{e iq> ) = c k eikq> > 0, the polynomial P(z) = z n p{z) = 

c k _ n z k has no zeros on the unit circle | z | = 1. Moreover, we have 

P(z) = [(s*) 2 ”P(l/**)]* 

which shows that to each root a of P(z) in the interior of the unit 
circle corresponds a root 1/a* in the exterior of the unit circle. Thus, 
P(z) has the roots a x ,..., oc n , 1/oq*,..., 1 /a n * and 

p(z) = c 1 n (* - «,x* - ik*)- 

k=1 


p(e iv ) = c n («** - - “ft*). 


Therefore 
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where necessarily c > 0. Hence, one can take 

n 

q(e^) = Vc (e^ - a k ). 

k= 1 

6 *6* For each A e7, denote by &(A) the greatest integer for which 
m Ma)( a ) - A) < 1 /»; 

such an integer exists since lim u n ( A) = lim n _>oo ^ n (A). Denote by /(A) 
the neighborhood of A consisting of all points x for which 

«*<*>(*) — v n (x) < 1 /»; 

/(A) is nondegenerate since u k(X) {x) and v(x) are continuous. The family 
of all /(A) constitutes a covering of 7. By Borel’s covering theorem, any 
covering [in our case {/(A)}] of a finite interval 7 has a finite subset 
[denote it by 7^),..., 7(A n ) in the present case] which also covers 7. The 
smallest of the number ^A^,..., &(A n ) is the k n we are seeking. 

6*1* Start with /(f) and g(£) positive, and prove the statement by 
going to the original definition of integration and approximating these 
functions by simple functions. Proceed then to more complex cases by 
retracing the definitions of integration in Chapter II, §3. 

6*8* Note that Up(U) = p(U)U for any polynomial p(U) in U, and 
then follow the construction of E x in Lemmas 6.1-6.3, using the fact 
that A s-lim B k = (s-lim B k )A. 

6*l(h Take g = Af in (6.25), consider Exercise 6.7, and note that 
<Af\Ejy = f ud(Ej\Ejy = f pd<f\ Ejy. 

* , R 1 J (-oo,A] 

CHAPTER IV 

1*1* Note that for any 

</1 AEl 2 ) g> = f A d<J | E^(E^g)y 
J or 

= f A d(E^f I E^g) = (E^f I A lg ). 

J u 1 

1*2* The commutativity of the spectral functions can be derived 
from the commutativity of the Cayley transforms 


Vh — (A k — i)(A k +*) _1 
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Hints and Solutions to Exercises 


by noting first that ^> 2 (^ 2 )] = 0 for an y polynomials [( 6 . 10 ) of 

Chapter III], and then retracing the construction of the spectral function 
E[ k) (Chapter III, Lemmas 6.1, 6.3) to obtain [E[\\ = 0. 

1*3* It is straightforward to verify that the statement is true for 
simple functions f(x). Follow, step by step, the generalization of the 
concept of integral contained in Definitions 3.4 and 3.5 of Chapter II, 
to generalize the result to arbitrary ^ p -integrable functions f(x). 

1*4* Taking in Exercise 1.3 p(x) = | ifj(x)| 2 one derives 
( VI <A (*)| 2 d"x = f x k * d^x) = f a 2 d(4> I 

J R n J R n *'DJ 1 

so that <3 Qk coincides with the domain of the self-adjoint operator with 
the spectral measure E (k) {B). Furthermore, applying the decomposition 
(5.13) of Chapter III to the complex measure < [ifj 1 \ E {k) (B) i/* 2 >, one 
reduces it to a linear combination of measures of the form | E (k) {B)\fiy 
to which the result of Exercise 1.3 can be applied to obtain 

f x^*(x) Ux) d«x=\ A d<^ I E^y. 

J u n V 

1*5* Verify that the statement is true for Borel sets B in U n of the 
form B = B x X * * • X B n , and therefore also in the Boolean algebra 
$ n generated by such sets. According to Theorem 5.5 of Chapter III, 
this implies that the two measures coincide on which is the 
Boolean a algebra generated by tffa. 

1*6* Verify that ZJ-'E^U = E(B) is such that < f\A lg } = 
J A d(f | E x gy for any fedtf and g e , and that (cf. p. 225) 

U-'S A = 9 Al = j*: J^A 2 d ||£ A £|| 2 < +00J . 


2*1* Retrace Definitions 3.4 and 3.5 of integrals, as given in §3 of 
Chapter II. 

2*2* Note that S ^ 1 X ••• X S A * is a closed set in R n , and consequently 
it is Borel measurable, so that the integration in (2.3) can be limited to 
S^l X •“ X S A *. 

2A* Take A unbounded, so that Ol A ^ . ForF\(A) = —F 2 ( A) = A 

we have 2 Fi(a) = 2 FtU) = 2 A , whereas 2 (Fi+Fi)(A) = 3^. For F 2 (X) = 
l/Fi(A) = A we have 2 F ^ A ) F ^ A ) C 2 Fi ( a ) ^ , whereas 2( F ^. F j( A ) — Jf?. 



Chapter IV 


643 


2.5. Use the result of Theorem 2.4 to show that </1 F 1 F 2 g') = 
if I ■f’AS') for all f g 6 JP • 

2.6. Show first that A is linear. Of A is dense since the set of vectors/ 
for which ( E — E x ^)f = f for some real \ < A 2 is contained ; in Of A 
(because then the above integral converges for all g e Jtif) and is dense 
in Jf*. Prove that A* D A and then show that Of A + C Sf A by duplicating 
the method used in proving Theorem 2.6. 

2.*l. Use the result of Exercise 1.3 to establish that Sf A = @ nQ } 
and that | F(Q X ,..., Q^) 0 2 > = \ A*ft 2 y for all e and all 

4*2 G ® A * 

3*L Write/(#, t) = s-lim At _> 0 (l/At)[ifs(x, t + At) — f)]. Accord¬ 

ing to the mean-value theorem of differential calculus for 0 ^ 0 X , 
0 2 < 1 

= f | Re 1 + 6 1 At) + iim t + d 2 At) — f(x , t) | 2 d 3n x . 

If we consider, when letting At —► 0, a sequence (JjJj., (J^) 2 ,...—> 0, 
then by definition 8((J*) n ) -> 0 for n-> +oo. Use Theorem 4.5 of 
Chapter II to derive 

f/ x t \ _ ii m 4*( x > * (At) n ) — */f(x, t) 

J\ x > t) — iim 

almost everywhere, where (J*)/, (J£) 2 ',... is a subsequence of (At\ , 
(J*) 2 ,... . Since by continuity 

4*t(*> 0 = [ Re, M*> t + e i(dt)n) + Urn if, t (x 9 1 + 6 2 (Aty n ) 1, 

7t-»0Q 

the result ifj t = f follows. 

3*2* Apply ParsevaPs formula in Theorem 4.6 of Chapter I to derive 
(AB) ik = <A% | Be k ) = £ | e,Xe, \ Be k ) = £ AA* . 

3*4* Employ the identity 

^(i/*)H 0 («+JO V ( t + At , 0) _ e U/H)H„t 0))^(0) 

_ Ji/fi)i{ 0 (t--At) U(t + At, 0) — U{t, 0) \p/Q\ 

At K ’ 

+ _ „«/*>*.<) [/(*, 0) ^(0), 
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Hints and Solutions to Exercises 


and consult the methods employed in proving Theorem 3.2 to show that 
in the limit when At 0 the above vector-valued function of t converges 
strongly to 

* H ( t ) 0) ^(o)) + I h 0 U(t, 0) 9(0) 

= - J o) ip(Q^ 

= -\ V(t) e - {i/ * )H <> i 9(t) = -1 H#) 9(t). 

3.5. Write R n ( A, £) = (iht) k lk\ . According to Theorems 

2.6 and 2.7, 

|| * n (J, O/ll 2 = </1 Rn*(A, t) R n (A , 0/> = f I **(A, Ol 2 d<f | £?/>. 

*'[—a,+a] 

Since it is easily shown by using Taylor’s formula that (for fixed t) 

I -R,(A, 01 “* 0 uniformly in A e [—a, +a] when n —»- oo, it follows that 
11 ^, 0 / 11 ^ 0 . 

3*6* First note that [(H 0 t) n i/f](r) vanishes outside B 0 for all t e U 1 . 
Hence, if the expansion were correct and the series were convergent in 
some sense, then we would have e~ {i t n)H ° l ifs vanish outside B 0 at all times t . 
Since e~ {i / n)Hot is unitary, this would imply that 

(• ^ W|2 dr = f ^ r j|t dr = j 

J B 0 J R 3 

for all t e U 1 . 

4*1 ♦ For a vector of the form/ x ® ••• ®f n we have 

IM a ®-® 4„)(A ® - ®/„)|| 

= 11 A/lllr -II AJnWn 
< Mi IIIIA lli -MJIII/, II, 

= Mi II-II -4,1! IIA 0 -0/JI- 

Extend the above inequality to ® • • • ® M' n by using an orthonormal 

basis of vectors ej** ® ••• ® 0”’ (see Chapter II, Theorem 6.10). 

4.2. Use Theorem 6.10 in Chapter II, and take advantage of the 
manner in which and are constructed from the vectors 

(4.5) and (4.6), respectively. 
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4 3* Note that 

( Z fh ® "• ®/<„ I Z w ( k l k n) SiCi ® - ® 

x «,.«„) 1 (&!.fc„> ' 

= X *(*1 K) ( £ </h I ^>1 - <A I £*„>„) = 0, 

<*!.*»> «!.«»> 

since the sum by which each nfa k n ) is multiplied on the right-hand 
side has the same value for each {k x & n ). 

4*5* Resort to Definition 4.1. 

4*9* Show that U x ® ••• ® U n is isometric on ^ ® a ••• ® a ^/, 
due to the isometry of U 1 ,..., U n . To prove that the range of 
® * * * ® U n is ^ , note that ® a • • • ® a ^ n ' is mapped onto 
®a *'* ® a ^n W > and that the algebraic tensor product is dense in 
the Hilbert tensor product. 

5.1 ♦ Prove first the statement for L 2 (IR n , \x) by following the procedure 
used in proving Theorem 5.6 of Chapter II; the only difference in the 
present case is that, since the boundary of an interval is not necessarily 
of ^-measure zero, the first part of the proof should be carried out not 
only for semiclosed finite intervals of the form (5.28) in Chapter II, but 
also for any finite closed interval (take all combinations of closed and 
open end points). Then prove the statement for any L 2 (U n > /jl) by using the 
definition of symmetric and antisymmetric tensor products. 

5.3. Let U v be the unitary transformation of 3$ v onto L 2 (U n *, fju v ) 
which is such that A' r (v) = where A n r v) is the multiplication 

operator in Definition 5.2. Then U = ZJ 1 ® ••• ® U m (seeExercise 4.9) 
is a unitary transformation of ® • • • ® onto 

L 2 (U n \n i)®-®L 2 (r-, ^), 

having the property that A' r {v) = UA^JJ- 1 , where 

X {v \4>? ® - ® *, w ® - ® 0£>) 

= ^ ® ••• ® (XVi) ® ® < a ! c ) . 

According to Theorem 6.9 of Chapter II, there is a unitary transformation 
0 of L 2 ((R n s ^i) ® ••• ®L 2 ([R n ™, fji m ) onto L 2 (R n i+---+ n m f ^ x ••• X /x m ) 
such that ® • • • ® ift^ is mapped into 

*1“ - x - x tx m ). 
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Hints and Solutions to Exercises 


The operator (OU) Aj. u) (OU)- 1 is the canonical form of A* v) in 
L 2 (M n i+-+ n ™ } p ± x ••• X n m ). 

5 A. Apply Theorem 2.7 to answer the second part of the exercise. 

5*5* Use the spectral theorem, and in particular (6.24) of Chapter III, 
to find the domains of AE(R) and E(R)A y and afterwards make use of 
Theorem 2.7 to derive that <g | AE{R)f ) = <g | E(R)Af ) for all 
/ g Of A and g g 

63. Note that it is sufficient to prove the statement for t Q = 0. Using 
Theorems 2.2 and 2.4, one derives (cf. (3.11)) 

II (e iAt ~ l)/ll 2 = f I * At ~ 1 \ 2 d<f\Eff\ /GJT, 

Jos 1 

where | e ixt — 1 | converges pointwiseto zero when t -> 0 and is bounded 
by 2. Hence, Lebesgue’s bounded convergence theorem (Lemma 3.1) 
can be applied. 

6*4* Note that since [/*([/* 0 ) -1 = it is sufficient to prove the 

statement for £ 0 = 0. Use the decomposition (6.1) of Chapter III, to 
establish that 

||([/‘ - 1)/||* < lim£ | - 1 | 2 ||(£ a , - E^)ff < e 2 ll/ll 2 

k 

if t is chosen sufficiently small so that | e iil x — 1 | ^ e for all (jl g [0, 27r]. 
Hence, || U l — 1 || ^ e for all such t. 

6.5. Show that £ x commutes with E n by proving that ZJ{2tt) 
commutes with E n . Then check that all the requirements of Definition 
5.5 in Chapter III are satisfied. 

6.6. Approximate/(A) within [—n, «], within the limits of accuracy 
1/n, by a trigonometric polynomial of period 2 n. When n —> oo, ^(A), 
p 2 ( A),... converges to/(A). 

If/(A) = Xi(fy> where I is finite, note that X/(A) can be approximated 
pointwise arbitrarily well by continuous functions of compact support. 

6 *8. Choose a finite interval I. The function Xi( x ) p( x ) * s square 
integrable on (R n . Select a monotonically decreasing sequence of contin¬ 
uous functions h^x), h 2 (x),... converging pointwise to Xi ( x )• Then one 
obtains J | F(x)\ | h n { A) — X/(*)l I p(A)| dx 0, i.e., $F(x) xA x ) P( x ) dx = 0 
for any continuous function F(x) of compact support. Since #’®(R n ) is 
dense in L 2 ([R n ), one concludes that Xi( x ) p( x ) = 0 almost everywhere. 
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Consequently, if I 1 , I 2 are finite disjoint intervals and (J I n — U n , we 
have p(x) = Y n Xi n ( x ) p( x ) == 0 almost everywhere. 

6*9* Show first that (e iQv ift)(x) = e ixv iff(x) by proving that (2.3) is 
satisfied when n = 1 , A ± = Q, (E°(B) iff)(x) = Xb( x ) f*( x )- Then use the 
relation exp (iPu) = exjp(—iUp X QUpU) = Up 1 exp (—iQu) U F , to estab¬ 
lish that (e iPu i[f)(x) = ift(x + fiu). 

7 A* Observe that if <• | •>/ denotes the inner product in and 
<• | •>/' denotes the inner product in then 


. m n . m n 

(Z <// 1 */>/)( Z <// 1 &">/) = Z z <fi I A')/ <// 1 *'>/• 

7.2. Apply to 


7=1 


i=l 7=1 


Z <// I gt'W <// I &">/ 

ij=l 

the theorem on the absolute convergence of double series [Randolph, 
1968, Sec. 3-11, Theorem 3, p. 162], which states that Yi,j=i a n con- 
verges absolutely if and only if Yf=i a ij — K and ^ converge ab¬ 
solutely, and in that case Y?j =i a n ~ i ^ • 

7.3. Note that E*=i <A I gk)k = Z*-i <U k f k | U k g k ) k , and show 
that the range of U is ^ . 

7*4* Start by proving the assertions for appropriately symmetrized 
functions of the form ( 7 . 6 ), and then, using the fact that the set of such 
functions is dense in the considered subspace, extend the result to the 
entire subspace. 

7.5. Show that the spectral representation space L 2 (IR 4 , p) of 
{Q (x \ £? (2/) > Q (z) y S n } is also a spectral representation space of 
{P {x \ P (2/) , P (z \ S n } by showing that 


(P (a 5 * 0 ) (Pi > P2 > P3 > s ) Pl^iPl y P2 > P3 > ^)> 

with similar formulas for P (y) and P u) , where ijj is the Fourier-Plancherel 
transform of \fj at each fixed s = —ct,..., 

7*6* Obviously A*** D A*. If fe @f A *** and /* — ^4***/, we must 
have </* | g) = < /1 A**g) for all g e @ A **, and therefore also for all 
g e B a . Hence, fe @ A *. 
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7.7. Compute first 

{iJhyi * exp f~W^X r i p i + - + r»P« - MV + ••• + r„ 2 ))] dv x - dr n 
= ex P[~(l/2£)(Pi 2 + - + p» 2 )]. 

Then observe that the Fourier transform of P(r x r n )/ 0 (r x r n ) is 

?(-(#!.-07w/ o (Pi,..., p n ) in case that/ 0 is continuous and of 

faster than polynomial decrease at infinity (these conditions are obviously 
satisfied by/ 0 = exp[(-l/2^)(r x 2 +-b r m 2 )]). 

7.8. Introduce the new variables r = r x — r 2 , p = r x + r 2 , and 
apply Theorem 3.13 of Chapter II to write 

Then treat the integration in r separately for r ^ R and r > R. 

1.9* Establish that 

I (—-4)* = u-Jtoa [l + Z 

fc=0 W->Q0 L fc=l J 

converges uniformly, and that 


(1 + A) t (-A? = (J (-^)(1 +A) = 1. 

/c=0 N fc=0 

8.1. An infinite orthonormal sequence {e x , e 2 of vectors from M 
is bounded but it has no convergent subsequence since || e m — e n \\ = \/2 
for m 7 ^ n. 

8.2. If A 0 g then E A (I) ^ 0 for every interval containing A 0 . 
Since P^({A 0 }) — 0> this implies that every neighborhood contains a 
point of the spectrum. In fact, if some closed neighborhood 7 0 of A 0 does 
not contain any points of the spectrum besides A 0 , then for every 
A g/ 0 we could find a subinterval I A of I 0 , such that E A (I A ) = 0. Use 
Borers covering theorem to establish the existence of a finite number 
I A / A of such intervals, with the property that J 0 = / A U ••• U I* , 
and, consequently, E A (I 0 ) ^ E a (I A i ) + ••• + E A (I A J = 0. 
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83* If A x , A 2 are the eigenvalues of A, it follows from the spectral 
theorem (see Chapter III, Theorem 6.4) that 

A = i KE A m). 

k =1 

8.5. Assume that the Borel set B 0 of negative numbers with 
A 0 = sup{A : A e /i 0 } < 0 is a subset of the spectrum of A. Then 
E A (B 0 ) ^ 0, i.e., E A (B 0 ) f 0 = f 0 for some f 0 ^ 0. Show that 

</1 4f> = f A d II £//ll 2 < A 0 f d\\ E x A f || 2 < A 0 ||/||» < 0. 

B 0 " Bq 

8.6. Build an orthonormal basis in by selecting orthonormal bases 
in £([0, oo))jT and E(( — oo, 0))J^ and taking their union. Note that the 
unconditional convergence of the series in (8.5) implies its absolute 
convergence, from which the existence of Tr(^4 (+) + A { ~ ] ) can be 
deduced. 

8.7. Apply the results of Exercise 5.10 in Chapter III to {^4 n I? n } 
and then to {{A n B n ) C n }. 


CHAPTER V 


1.1. In deriving |r(£)|— t | v ex | —> r ex v ex /| v ex |, note that if one 
sets|r(£)| = | r ex + v ex £ | + i 7 ex (£), then r} e *(t) -> 0. In fact, if | t | is 
large enough, 


i m 


1 1 v ex I = ri^(t) + (| r ex I 2 + 2*r ex v ex + t 2 1 v ex | 2 ) 1/2 — t 

2 r ex v ex | j* ex I 2 x 1 / 2 


v ex I 


: V ex(t) 


t I v <! 


[(*+?■ 


I v ex I 2 

= 7l ex (^) + t I V ex I (- (— —-4 _- 

V U-h I 1 \2 \t I v ex I 2 ^ t 2 I v ex I 2 


r t 2 | v ex | 2 
1 | r ex | 2 


■) -•] 

)M±)) 


r ex v ex 

for ^Too. 

1.2. Recall that for the given potential the general solution for a 
r(£) which is not a bound state is a hyperbola or parabola. If r(t Q ) = r 0 is 
the minimum of r(t) = | r(*)|, then drjdt 0 for t t 0 . Hence, r(t) 
has an inverse in (— oo, £ 0 ] and in [£ 0 , -foo). Since 


$m 0 v 2 + 1 \r — \m Q (t 2 + r 2 c/> 2 ) + 1 /r = const and r 2 (j> = const, 
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we have 



where b and c are constants. Consequently, setting 

r — —= (u -|-^- 7 =) and a = Vi + be 2 ’ 

Vb \ 2 Vb> 4 

we obtain 

dr 

JX U75 

\ m 0 \ r r 2 1 

= - « 2 ) i/2 + y - w -‘h 1 «+ v “ 2 - 1]“ •• 

Using the above result and recalling that r(t) -> + oo when £ —> =p oo, 
we get r{t)jt -> (26/mo) 1 / 2 , so that | v ex | 2 = 2 b/m 0 according to Exercise 
1.1. Show that r(t) — t | v ex | = r(t) — ^(26/mo) 1 / 2 diverges, instead 
of converging to r ex v ex /| v ex |, which should happen (according to 
Exercise 1.1) if asymptotic states exist in the sense of (1.5) and (1.6). 

13* Use Fubini’s theorem to derive from < K*g | /) = | Kfy 

that K* is an integral operator of the above form. 

I A* It follows from Theorem 1.1 that ¥^(0) = Q + W in (0) and 
¥7(0) = Q_W <>^(0). Show that Q_*W{0) = JP ont (Q) = V^O) 
[see (2.10)]. This implies that ¥*"*(0) = = sxp H 0). 

1.5* The computation will be the same as in the case treated in the 
text, except that the integration in P cannot be eliminated in (1.38), but 
rather must be retained to the end of the calculation. 

1*7* Note that 

<m 1 w 1 Av 0 (t)> 

= <y(o - v 0 (t) | AV(ty> + <y o (0 I Am - y o 0)>- 

1*8* Note that E {V) (B) commutes with H^\ and consequently 

<V™{t) I E (V) (B) = <1P«(0) | exp(iH^t) E^(B) exp (-iH^t) V ex (0)) 

= <W ex (0) I E"»(B) ^(0)> 

for any B e 88 z \ then insert A = E»\B) and W 0 (t) = exp(—iH ( ^t)W ex (0) 
in Exercise 1.7. 
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1*9* Consider the case of $ € (k) = A e (k)0(k), where (cf. p. 218) 
^(k) = exp (-c-g-*) 

Taking inverse Fourier transforms of both sides of the first relation 
appearing in this exercise leads to 


(exp (-iH 0 t) 0 e )(r) = (2 t t)~^ 2 f exp{i[kr - (k 2 l2m)t]}fc(k) dk 


= (2tt)- 3 dk exp{i[kr - (k 2 l2m)t]} * € (k) exp(-ikr') 0(0 

•'dS 3 *V 3 

Since the above integrand is integrable on U 6 in (k, r), Fubini’s theorem 
can be applied: 

(exp(-*H 0 0 0 € )(r) = (2 tt)- 3 [ *'0(0 [ dk exp{*[k(r - r') - (k 2 l2m)t]}h£k). 

V *V 3 

After integrating in the (0, <f>) spherical coordinates of k, thus arriving at 
An r exp[— (e + iWl2m)t] am k 1 r ~ f 1 k dk 


2iTtn 
€ -|— i 



e -|- i 

2m 


k 2 t + a | r — 



and then letting e —> +0, one obtains the desired relation. In fact, 
the limit on the right-hand side of the preceding relation can be taken 
under the integral sign by applying Lebesgue’s bounded convergence 
theorem (Chapter IV, Lemma 3.1) to infer that for 0eL 1 (R 3 ) nL 2 (R 3 ) 


Hmf |0(r')| f exp{i[k(r - r') - (k 2 l2m)t]} | 1 - h € (k)\ dk 

•'(R 3 • / o $ 3 


dr' = 0. 


On the left-hand side, the continuity of the operator e~ iHot implies that 


s-lim 0. 

e ->+0 T€ T 


L10* Since it is quite obvious that C/{ 1) and U j 2) map L 2 ((R 3 ) onto 
itself, it is only necessary to check that these operators are isometries; 
for example, setting p — mrjt, one gets 

|| || 2 = (m/tf f | famr/t) |« dr = f | <ftp)| 2 d 9 = U || 2 . 

J R 8 J D5 3 
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Apply C7j L) t/j 2) to integrable functions </>(r), eZ, 2 (IR 3 ), and use the 

result of Exercise 1.9 to write that almost everywhere in R 3 

(UFuPfflr) = exp[im(r 2 j2t)]( t/< 2) 0)~(mr/O 

= {mlitf 12 exp[ im(r 2 l2t)] 1 /(2tt) 3 / 2 f exp[-t(mr/0 r'] 

J R 3 

X exp[*7«(r' 2 /2J)] 0(r') </r' 

= ( m/lnit )* 12 f exp[/(m/2^)(r — r') 2 ] 0(r') </r' 

= (exp( —iH 0 t)i/i)(r). 

Since the family of all such functions is dense in L 2 (IR 3 ), and since both 
e -iH 0 t an( j jja)jj(2) are b oun( i e( i operators, the above relation can be 
extended by operator continuity to all ofL 2 (R 3 ). 

ML Note that due to the unitarity of U\ 1] 

IKexp (-iH 0 t) - || 2 = || - 1)0 1 | 2 

= ||(C/< 2) -l)0|| 2 

= f | exp[*7«(r 2 /2j)] — 1 | 2 1 0(r)| 2 dr. 

J R 3 

When t —► =f oo, the above integral converges to zero by Lebesgue’s 
bounded convergence theorem (Chapter IV, Lemma 3.1). 

1*12* By taking in Exercise 1.7 ^ o (0 ~ e~ iHot ift, W(t) = and 

A equal to the projector on C (+) , i.e., (Aifs)(r) = Xc<+>( r ) *A( r )> one gets 

lim [ f 0)(r)| 2 dr - f \(U^)(r)\ 2 dr] = 0 

f->T°oU c (+) J c (+) J 

The substitutions k = ^fmrlt yield 

f \(U^)(r)fdr = {mjtf f 1 fomr/t)? dr = f |0(k)| 2 rfk. 

J C (+) J C (+) J C (T) 

2A ♦ Take a bounded interval 7C[—a, -fa], and approximate 
Xi( A) in the mean on [—a, +a] by a trigonometric polynomial p n ( A) = 
£fe=-» exp[(7r^/a)A], If 

n 

Pn{Aj) — ^ a k ex p[^j( 7r ^/ a )]j 

k——n 
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one easily derives p n {A^)C = Cp n (A 2 ). Since by Theorems 2.5-2.7 
in Chapter IV p n (A-) —*■ E A i(I u / a ), I a = {A + 2aj : Xel, j = ±1, 
±2, ...}; letting ol —> oo so that E a *(I 2 ) —> 0, one obtains E Al {I}C = 
CE A *(I). This result can be easily extended from finite intervals / to 
arbitrary Borel sets Eg# (see Chapter III, Theorem 5.6). 

2*2* Use the method employed in proving Theorem 4.1 of 
Chapter III. 

2.3♦ Prove that (1 — E)f = 0 by establishing that 

</!/> = <£/1 £/> + <(1 - E)f | (1 - £)/>. 

2.4. Use (2.16), (2.20), and Theorem 2.7 of Chapter IV. 

2.5. Note that the bounded operators A N = J[~ N+N ]hdE A leave 

M„ = M n E A ([—N, invariant. Hence, any polynomial 

p(A N ) leaves invariant and therefore E An (B) = E A (B\ B C [—TV, 
+iV], leaves M N invariant. Since -> M when iV— > oo we get 

E A (B)fe M for fe M, and E A (B)g _j_ M for g J_ M, i.e., E n E A lB) = 
£„ . Consequently, E A {B)E n = £„* = E M E A (B). 

2.6. Note that E A commutes with e iAt and therefore 

f A 2 d || Efe iAt ff = f A 2 d || e iAt Eff || 2 = f A 2 rf|| Eff\\\ 

J R 1 J u 1 J r 

3.1. The argument runs along the same lines as the proof of the 
corresponding statements about. Q_ e and Q +e . 

3.2. Note that 

ii(o* - a*w)/n a = - f 1 mf> dt 

J s 

< ||(fl, - fl^))/|| « fe-^dt. 

J S 

3.3. Use Definitions 3.1-3.5 of Chapter II. 

3.4. Note that the linearity property is shared by every Riemann- 
Stieltjes sum in the sequences whose limits define these integrals. 

3.5. Establish these properties for the corresponding Riemann- 
Stieltjes sums in (3.18) and (3.20), and take the limit. 

3.6. Prove first that if F( A) and F( A, A 0 ) are nonnegative, then the 
corresponding Riemann-Stieltjes sums are nonnegative, and therefore 
the corresponding integrals are also nonnegative. Then set F = F x — F 2 
and apply the linearity property (see Exercise 3.4). 
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3*7* Prove the statement first for proper Riemann-Stieltjes integrals 
by establishing it for Riemann-Stieltjes sums and then taking the limit; 
afterwards, generalize the result to improper Riemann-Stieltjes integrals. 

3*8* If m ^ .F(A) ^ M on [a, ft], then using the result of Exercise 3.6, 
one can derive 

m f b d x a( A) < f b F( A) d x a( A) < M f & d x a{ A). 

J a J a J a 

Consider the nontrivial case when o{a) < a(b). If m and M are the 
minimum and the maximum of .F(A) in [a, ft], respectively, then F( A) 
assumes all the values between m and M on [a, b ]; hence, there is a point 
A' at which 

F(X)=jy(\)d x o(\)/j\ow- 

For the existence part, prove that the limit (3.17) exists and is inde¬ 
pendent of the chosen sequence of subdivisions {A (fc) } by showing that 
if {A (Aj,l) } is a finer subdivision than {A (fc) } [with A (fc * 0) = A (fc) ], then 

£F(A' (fc) ) o((A (fc “ 1) , A< fc >]) - £F(A'< fc ’*>) o((A< fc - l >])| 

k k,l ' 

can be made arbitrarily small for all sufficiently fine subdivisions 
{A (fc) } [here X (k > l) denotes the point preceding in the subdivision 

{A (fc » l) }]. To obtain this result, exploit the uniform continuity of .F(A) 
in [a, ft], according to which for given e > 0 we have 

|F(A' (fc) ) — F(A' (/M) )| < € 

for all sufficiently fine subdivisions {A {k) }. Hence, the above difference is 
smaller than 

£ | F(A' {k) ) -F( A'< fc * J >)| A<*•«]) < €<r((a, ft]). 

k,l 

3*9* Note that 

</1 A R *g> = <A R f | *> = f ' <A(f)f | £> MO = f </1 A*(Og> MO 

Jr j r 

for all f,ge 

3*10* Note that (g | e iu ~ l)t /) is continuous and therefore Borel 
measurable in t , and that 

/•+« -00 

II e« A -n*f II dt < II/II e tlm ‘ dt < +oo 

J o o 
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for Im £ < 0. Hence, e t(A t )l is Bochner integrable. Using the spectral 
theorem, we obtain for any g e 

<g \Ra(0/> = I E £f> 

.+00 

e Wi-C)t fa 

0 

= -1 f dt f ««*-«* dig I Effy 

J o •'r 

= -i C <g I e iM ~ £)i /> dt, 

J 0 

where the order of integration could be reversed by Fubini’s theorem. 

4.1. Use Fubini’s theorem to carry out the integration in (4.41) 
successively, first with respect to +** and then with respect to fi^K 
In view of the definition of /x<“> in (4.40), we obtain the desired result by 
applying Theorem 3.11 of Chapter II with R n = {(/, m)}. 

4.2. Let B + and B_ be the Borel sets in R 2m on which G a) (x, x'\ £) — 
G i2) (x, x'\ t) is positive or negative, respectively. For any Borel sets 
■®l > ^2 e of finite measure we get, using Fubini’s theorem, 

| g Xb^ = J# x ^ x ’ x ’ X /x), i = 1,2. 

Hence, if B x x B 2 C B ± , we infer (see Chapter II, Lemma 4.1) from 

f (G^(x, x'; 0 - G< 2) (x, x'; 0) d(fx X /x) = 0 
j b 1 xb 2 

that B x X B 2 is of yu. X /u. measure zero. Since by the definition in 
Theorem 2.4, Chapter II, of products of measure, we get from Theorem 
1.7 and (2.26) of Chapter II 

(ft X /x)(B±) = sup | £ Ox X ix)(B[ k) X B (k) ): £ B[ k) x , 

we thus deduce that (/u. X H)(B ± ) = 0. 

4.3. Note that 

I 'Ao*( 1 ‘)(G r o(r, r'; A -\-rq) — G^{t, r'; A)) F(r') </f(k)| 

< M B (r) | «A 0 (r)| | | I $(k)|, supp $CB, 


] <*<£ I ^/> 
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where the function on the left-hand side of the above inequality con¬ 
verges pointwise to zero when rj —> ±0, while the function on the right- 
hand side is integrable on IR 9 in r, r' and k. The desired result is then 
obtained by applying Lemma 3.1 of Chapter IV. 


4*4* If Xj(y) denotes the characteristic function of /l -1 ((A^_ 1 , AJ) 
then 





g*(ym I tfiFW)/) 


= f t I *W)/> xAy) My)- 

J B5 n j== i 


The integrand on the right-hand side of this equality converges pointwise 
to the integrand on the right-hand side of (4.84). Lebesgue’s dominated 
convergence theorem can be applied since on supp g(y), || H 1 x 0 y || < 
Cj , and consequently, 

| <<M /W/)/>l < 11 h^a II F(A/)/|| <CjC||/||. 

4*5* Write (4.103) in terms of matrix elements with/, g e C Qj v , 

<£ I ^(£)/> = <g I vf> + <Vg I (£ — ^o)" 1 *KQ/> 

and express both sides of this equation in the momentum representa¬ 
tion, using (4.82): 

f *!*(kX0k |*w> = f *r(k)(«k I Vf> 

J D5 3 J D5 3 

+ f dk\Vg I & k *)[i - k" 2 /2| 

J B8 3 

Since V(r)@ k (r) eL( 2 )(IR 3 ) if F(r) eL( 2 )(IR 3 ), we have 

(<Pk I Vf> = <V0 k l/> = J <V0 k I 0 k .) dk’(&r I/>, 
with a similar result holding for <F^ | <P k ), and also 

I MQ /> = <[1 + FG*(Q]F<V |/> 

= /<[! + FG*(Q]F<P k . | 0 k ') dk’(<P k . | />. 
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Inserting these results in the preceding equation, and noting that 

<V0 k I <*v) = (& k \V\ <ZV) = <k I F | k'>, 

[1 + FG*(£)]F<Zv | 0 r ) = (<Z> k * | V + VGtf)V | 0 k O = <k" | ^(0 | *'>, 

we arrive at the result that the left and right hand sides of (4.105) 
multiplied by g*(k)/(k') and integrated over U 6 have to be equal at 
least for all /, g e ^ b °(IR 3 ), and therefore (by the same argument as in 
Exercise 4.2), that (4.105) has to be satisfied almost everywhere in U G . 

4*6* Write (4.113) in terms of matrix elements for /, geL 2 (U s ), 
use (4.96), and apply the argument of Exercise 4.2. 

5A* The function/(r, r') = | r — r' | is continuous and therefore 
Borel measurable in (R 6 (Chapter II, Theorem 3.1). Since D Gq = / _1 ({0}), 
it follows that D Gq is a Borel set. To establish that its Lebesgue measure 
fJLf(D Gq ) is zero, note that 


/4W = f dv dv\ 

and write the above integral as an iterated integral by applying Fubini’s 
theorem. 

5*2 * Since K(oc , a') as well as /(a) g(oc') are square integrable in U 2n 
with respect to /jl X fi, the function K(oc , a')/(a:) g(oc') is integrable in 
IR 2n . Hence, Fubini’s theorem can be applied in the following derivation: 

</ \ Kg) = f <W«)/*(«) f dpi*’) K(a, «')g(oc') 

J R n J R n 

= f f^)K(a,a)g(a)d^((x)dfx(oc') 

J R 2 n 

= f «')/(«))* = <K*f\g), 

J J l5 n 

which shows that K * has the kernel K*(oc', a). 

For arbitrary feL 2 (U n , p), the function 

f n I « W) I <W«') 

J D 5 n 

< [J m „ I K(a", a ')| 2 ^')] 1/2 [J M „ |/MI 2 ^(“)] 1/2 
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is ^,-square integrable in oc' e U n , and consequently 

f d K a l\ «)| f dp(oc')\ K(ol \ oc')f(oc') I 

J R n J R n 

exists. Thus, we can apply Tonelli’s and Fubini’s theorems in succession 
to infer that 

f d^oc") K*(oc", a) f d^oc') K(oc", oc')f(oc') 

J R n J R n 

= f d^oc') /(ex') f di4*') K*(oc", oc) K(pc'\ oc f ). 

J R n J R n 


53. According to Theorem 8.6 of Chapter IV, K is a Hilbert- 
Schmidt operator if and only if K*K is of trace class. In order to 
establish that K*K is of trace class, it is sufficient to establish that 

Tr (K*K) = £ | K*K^y = £ || Rf, t || 2 < + «> 

fc=l fc=l 

is some orthonormal basis , ift 2 

In view of the fact that the function K(oc , oc') is an element of 
L( 2 )(U 2n , fj, X I *) and that {fa ® h k = 1,2,...} is an orthonormal 
basis in L 2 (R 2n , /jl x /x) (see Chapter II, Theorems 6.9 and 6.10) 
we obtain 

f | K(oc , oc ')| 2 d^d^oc') 

J |jj2n 

= £ I f '/'**(“) ^(“') I 

<.*=i I J k 2 ” 

-f (£|J 


i//x(a) 0i*(a) dn{oc) K(oc, oc') fa(oc') j 


I f>(«) 

fc-1 J D5 n 


K{* , «') <//*(«') = X || *0* || 2 = Tr(iC*^). 


In deriving the above equation we have used in the first step Parseval’s 
relation in L 2 ((R 2n , fi x (jl ), in the second step the unconditional conver¬ 
gence of a double series with positive terms, as well as Fubini’s theorem, 
and in the third step Parseval’s relation in L 2 (IR W , /jl ). 

5*4* Apply the Riesz-Fisher theorem (Chapter II, Theorem 4.4) to 
conclude that K(oc , oc') is square integrable on IR 2m . 
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5*5* The operator BA is completely continuous by Chapter IV, 
Theorem 8.1. Suppose that A = A* and that {e x , e 2 ,...} is an ortho¬ 
normal basis for which Ae k = X k e k . Then 

ZII BAe » II 2 = E VII Be k |P ^ || B |P £ < +oo, 

k k k 

which implies that BA is a Hilbert-Schmidt operator. 

In general, if A ^ A*, then we can write A = A x + i^ 2 , where 
A 1 = A x * and A 2 = A 2 *. Then BA is a Hilbert-Schmidt operator 
since BA X and BA 2 are Hilbert-Schmidt operators by the above, and 
a family of Hilbert-Schmidt operators is a linear (normed) space (see 
Theorem 8.10 in Chapter IV). 

5*6. Note that for any/e^f 

• m II m 

E AJ < I II A k || ll/H 

k=n +1 k=n +1 

can be made arbitrarily small for sufficiently large n and all m > n. 

Hence, 

A f = s ‘ h l m E A *f 

n " +co ti 

exists for all / e Jf, and obviously defines a linear operator on 
Moreover, since by letting below n —► + oo we see that 

M/ll < if A k f\+\ f A k f\\^ f||A II ll/ll, 

fc=l k=n+ 1 k= 1 

we conclude that A is bounded, and the desired estimate is obtained for 
|| A ||. By the same procedure we get 

|M-ea|=i e a|< e iiaii-o, 

fc=l fc=»n+1 ' k=n +1 

when n -> + oo. 

5.7. Note that 

^i^L|| + |p <MIP 

and SIM II* < +oo. 
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5*8* Since m A is a lower bound of S A , 

M-oo --+-00 

</1 Af>= Xd\\ Efff > m A f d\\ Eff || 2 = 

J m A J m A 

Conversely, if </| Afy ^ y ||/|| 2 , then y must be a lower bound of 
S A . In fact, if that were not so, then wp would have A 0 e S A for some 
Xq < y. This means that we could find for € 0 = ^(y — A 0 ) a nonzero 
vector / 0 e Of A such that ^([A 0 — € 0 , A 0 + € 0 ]) f 0 —f 0 . Consequently, 
we would have 

</« 14/o> = f Arf </ 0 1 Eff 0 y 

^o~ € o 

< (K + *«)</„ I E'Wo -e 0 ,x 0 + £o ]/o> < rll/oII 2 - 

which contradicts the initial assumption that </ 0 | Af o y ^ y II /oil 2 - 


6*L Introduce spherical coordinates (r', 0, <j>), orienting the # axis 
along the vector r. Then 


J. 

J r'^R x 


dr' 

| r' | 2+e o| r - r' I 


= 277 



sin 0 

(r') e o[(r') 2 — 2 it' cos 0 + r 2 ] 1/2 



[r 2 + 2rr f + (r') 2 ] 1/2 - [r 2 - 2rr' + (r') 2 ] 1/2 , , 

r(r') 1+e o dT 



dr' 

r(r') € o 



dr ' 

(r') 1+e o 


477 




l + _ 

r ^ (1 


477 


e o) e o 


r € o 


°(“r) if 


6*2* Let R 0 be such that r 2_Hr ° | F(r)| < C 0 for all r > R 0 . Divide 
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the domain of integration into two parts, so that 

1^21 *. + c. f 


dr' 


' 0T I 

and note that 


>Rq I *' l 2+e °l *' ~ r I 


r — r Ur' 


dr' l 1 ' 2 


1“ ^ Ur-o, Ir-r'| 2 


it r a /2 

jj J V(r')\*dr'\ . 


Constants C ± and C 2 can be chosen which are such that (see Exercise 

6 . 1 ) 

r dr' c dr' 

« r 


r'l* ^ Ci ’ L 


> r 0 I r' P +E o I r' — r | ^ Ci! 


•'/<«„ I r - r' | 2 

for all r e R 3 . The continuity can be proven as in Exercise 6.9. 


6*3* Since V(r) is square integrable for r x ^ R 0 , we obtain (by using 
the Schwarz-Cauchy inequality) for r > R 0 




dr i 




|r 


1/2 


= const 




here r av , r av ^ R 0 , is a vector (whose existence follows from the 
mean-value theorem of the integral calculus) for which | r — r x |~ 4 
assumes its average when integrated in r x over the sphere {r x : r x ^ R 0 }. 

Since | F(r x )| ^ C 0 rf 2 “ e ° for R 0 , we easily carry out the following 
computation by orienting the z x axis in the direction of r: 


J 


I ^(rJI 

| r — r x | 2 


dr i ^ 



sin 0 

r*o(r 2 — 2 rr 1 cos 0 + r*) 


</0 


= <r 
r 


1 

r l+f 0 


In 



* c o f +a> 
^ 1+e ° J*o/r 



It is immediately seen that the above integral is O(l/r 1+eo ) when 
0 < € 0 < 1. For € 0 ^ 1, the integral on the right-hand side of the 
above equalities does not converge when the lower limit R 0 /r is replaced 
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by zero. However, then 







To prove the local integrability in r of the function under con¬ 
sideration, note that the function | F(r x )| | r — r x |“ 2 is square integrable 
on the set {(r, r x ): r < R < +oo, r x e R 3 }; this follows (by Tonelli's 
theorem, Chapter II, Theorem 3.14) from the existence of the iterated 
integral 


/ *J F(rJ| J 


dr 


r^R 


r, | 2 


Hence, the desired result follows by Fubini's theorem. 


6*4* By using the Schwarz-Cauchy inequality we obtain 

f I V{rj\ dr 2 | r \ V(r 2 )\ dr 2 p r V(r 2 ) dr 2 ^ 2 

Jr 3 I r — r 2 1 | r 2 — r x | ^ | r - r 2 | 2 i IJ I r 2 — r x | 2 j 


Thus, the present case is reduced to the case considered in Exercise 6.3. 


6*5* Combine the results of Exercises 6.2 and 6.3; note also that V(r) 
is square integrable. 

6*6* Since V(r) is locally square integrable it is measurable on M 3 
(see Chapter II, Theorem 4.1). Since | r — r' | —2 V(r) satisfies 
| r — r' | —2 | F(r)| < C 0 | r — r' \- 2 r~ 2 ~^ for r > R 0 , it is integrable 
on the set {r: r ^ i? 0 } due to the integrability of | r — r' \~ 2 r~ 2 ~ e ° on 
that set (see Chapter II, Theorem 3.9). By Fubini’s theorem the function 

f JF(r)||r-r' \~ 2 dr 

J DT 

is locally integrable in r', and majorized by const | r' |- 1_e ° for | r' | ^ R 
(see Exercise 6.3). Hence, it is integrable on the set {r' : r ^ Z? 0 }. 

The integrability of | F(r)| | r — r' | —2 | V{r')\ on U 6 now follows 
by Tonelli's theorem from the existence of the iterated integral. 
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6.7. According to (6.29) 

K«| ^ F v (r; k 2 /2m). 

Use the result of Exercise 6.5 and the continuity of d(k 2 j2m) to prove 
that F v ( r; k 2 /2m) < const for all r > R 0 and all k e D 0 . 


6.8. By the mean-value theorem of differential calculus 
AI _ _ 

A " ' M s 


Au 


= / a ^ r ’ u + Au )~ H r > ")]/( r ) dr 


= f h u (v, u + 6 Au)f( r) dr, 0 < 8 ^ 1. 

J M 3 

For given e > 0, choose R ± > *0 so that 

I f K( r > ")/(*■) * I < f p(*) dr < c/2. 

1 J r^R, I J r^R 1 


Since h u { r, w) is uniformly continuous over the region {(r, u + Au): 
r < R x , | | < const}, we can choose Au so small that 

f | h u (r, u + SAu) — h u (r, u)\ |/(r)| dr < e/2. 

^ r^R 1 

Hence, we have 


AI 

Au 


- f , h u(r, u)f(r) dr 

J ICT 


< € 


for sufficiently small \Au\. 


6.9. For given e > 0, choose a so small that 


exp (ik 1 1 *! — r' |) 


Hr') dr' <f -rj ffi-K. dr’ 

•'|r'-r|<a I r l r I 


|r'-rl<a 


(f |r x -r'|-**') (J |/(r')|W) <* 

'•Mr — r|<a ' '*Mr'—rl<a ' 6 

for all r x from some closed neighborhood Jf of r. Then select R 0 > 0 
so large that for all those r x , 


l 


exp(t&| r x - r' |) 




/(r') dr' 


^ const f i r < |. 

Jr‘>xjr | 8 +‘o 6 


Since | r x — r' | 1 exp(^ | rj — r' |) is uniformly continuous for r x e 
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and r' e S Q = {r': | r' — r | ^ a, r' < 7? 0 }, we can obtain 



exp(*£| r — r' |) 
|r-r'| 


exp(^| f! - r' |) 
I r, - r' | 


1/(01 *' < 5 


for all sufficiently small values of | r x — r |. Combining these three 
results, we easily obtain | £(* 1 ) — £(r)| < e for such values of | r x — r |. 


6JO* Since is a closed set of measure zero, it is possible to 
construct a sequence A x (r), A 2 (r),... °f infinitly many times differentiable 
functions with supports disjoint from , which is such that J | h n { r) — 
1 | 2 dr —> 0. An example of such a sequence would consist of the functions 
h n {r) = 0 for r e \ , h n { r) = 1 for r ^ at a distance d{ r) > 1 jn 
from , and h n (r) = exp(— n 2 d 2 (r)[n 2 d 2 ( r) — 1] _1 ) for \ and 

d(r) < \\n. Since f n (r) = /(r) h n (r) e for any /(r) e ^(R 3 ), and 
U 3 ) is dense in L 2 (R 3 ), so is . 


7 Setting 


F{u) = f f(t) dt , u > 0, 

J 0 

and integrating by parts, we obtain 

I e~ €t f(t)dt= lim f e~ et f(t)dt 
J 0 a ^+00 J 0 


a-»+°o 
/•+« 


= lim ] e~ €t F(t) 


t =0 


f 

J n 


e 1 e~ et F(t ) dt 
' 0 


/•t<* 

= € e~ €t F[t)dt. 

J n 


This establishes the existence of I € for e > 0. We note that the above 
limit for oc —> + 00 exists since lim^ +00 F(t) = I 0 exists [and con¬ 
sequently lim^+oo e~ € 0 t F(oc) = 0], and since from the continuity of 

F(t) it follows that | F(£)| ^ C 0 for 0 ^ t < + 00, so that Jo°° e~ H F(t) dt 
exists. 

Using a method already employed in proving Theorem 3.1, we write 

I r+°° r +co I 

| I e — I Q | = e e~ €t F(t) dt — e e~ et I 0 dt 
€ I J 0 0 1 

ir < f (t) - '•) * 1 < r 1 ■ f (t) -•'•> h 

For any given rj > 0, select oc > 0 so that oc < (>?/2)(C 0 -f- | / 0 |), 
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and consequently 



du < | « | (C 0 + | /|) < l. 


For such a fixed a > 0, choose e > 0 so small that I F(uU) — /. | < V I2 
for all u> a. Hence 




and consequently 


du 








r+r 

J 0 J a 


< 


+ 


= v- 


We note that the proof would be much more straightforward if 
1/(01 w ^re integrable on [0, —|— oo). However, in the present case we 
deal with an improper Riemann integral and not with a Lebesgue 
integral, and the convergence of the improper Riemann integral of/(*) 
does not imply the convergence of the improper Riemann integral of 
1 /( 01 - 


7.2. Note that since /(k) is integrable, 

(2^) 3/2 |(exp(—*7/ 0 ^)(r)| 

= I f exp[— i{t\s?j2m — kr)] /(k) dk I < f | /(k)| dk. 

1 V 1 V 


7.3. Using (6.58) and (6.38) we get (see also Excercise 6.5) 

f dm’ f dm | T™(k', to, to')| 2 
J a s J a a 

< const f dm' f dm | f | F(r')(l + F v ( r'; k 2 j2m)\ dr'l* 

Jq s Ja s 0,3 ) 

= 16tt 2 const | J | V(r’)\ (1 + F r (r'; k 2 j2m) dr'j . 

Hence, the square integrability of T a) (k; to, to') follows by Tonelli’s 
and Fubini's theorems. 
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1A+ Using (7.63) we get 
<?Wi I T(k)f*> 

= (l /4^K(S(k) - 1)A I (S(k) - l)/ 2 > 

i*2rr 

= (1/47T 2 ) I - 1) d((S(k) - 1)A I £#)A> 

J 0 

/•2rr 

- (1/4^) J (*“ - 1) d,i(S(k) - 1)A I (E^'(k) - E^k)) E x {k)U ? . 

J 0 

On the other hand, 

<iE^(k) - E h {k)) E y (k)U I ( S(k) - 1 )/ x > 

/•2rr 

= f («“ - i) <4 <(£a 2 -(*) - £a 2 (*)) M*)/* I Uk)h> = o 
J 0 

since [£ v (&) — E x <Jk)\ E x (k)f x = 0. 

7*5* The Legendre polynomials Pi(u) are the solutions of a Sturm- 
Liouville problem (see Chapter II, Theorem 7.5). Hence, the expansion 

i I (21 + 1) P t (u) f 1 P t (u) e*™ du 
Z 1=0 J -i 

is a Sturm-Liouville expansion, and as such it converges uniformly 
(see Titchmarsh [1962, Theorem 1.9]). To calculate the coefficients 
in this series, expand e iaU in a power series and show first that 

/*+i 

J u m P t (u) du 

is zero unless l + m is even. For the case of even l + m, the above 
integral can be computed by using standard relations obeyed by Legendre 
polynomials (see Butkov [1968, Chapter 9, in particular Problem 17]). 

7 *6* Note that as in the case of (7.2) 

(*-^ ro )(r) = (2 t t)~™ f exp[-(^ + it)(k^2m) + ik(r - r 0 )] dk 
* , os 3 


and carry out the integration in ke IR 3 . 
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7.7* Note that the required result has been established for ipl(r) = 
^r 0 (r) in Exercise 7.6. Now, according to Lemma 7.1, the set {<f> T : r 0 e R 3 } 
is dense in L 2 (R 3 ). Since || £%B) || < 1, we can apply Lemma 1.1 

to reach the desired conclusion for arbitrary <p 0 eL 2 (M s ). 

7.8. Since || if/ t — || -> 0 when t -*■ ^ oo and 

|| E°(B)+ t || < || & - <l>?\\ + || E q (B)^ x \\, 

the desired result follows from Exercise 7.7. 

8.1. (a) Four channels: (n, p, e), (n-p, e), (n-e, p), and (p-e, n), 
plus the bound states ( n-p-e ). 

(b) (n-e) has not been observed so that = {0} as far as 

present experimental knowledge goes. 

(c) (p—e) = hydrogen atom, ( p—n ) = deuteron, ( p—n—e ) = deuterium. 

8.2. (a) Eight channels: (p, p, n, n), (p, p, n-n), (n, n, p-p), (n, p, n-p), 

(n-p, n-p), (p-p, n—n), (n, n-p-p) and (p, p-n—n) plus the bound states 
(n-n-p-p). 

(b) ( p, p, n, n), all four particles are free; (n, p, n—p) = neutron, 
protron, and deuteron; (n-p, n-p) = two deuterons; ( p, p-n-n) = pro- 
tron and triton (p-n-n); (n, p-p-n) = neutron and the ionized helium 
isotope ( 3 He) 2 + of 3 He; (n — n— p — p) = alpha particle ( 4 He) 2 +. 

8.3. Since/(t) is continuous and has a limit at +oo, it is bounded; 
i- e -> 1/(01 < M for all 0 < t < +co. For given e < 0 choose Me) 
so that |/(f) — a | < e/2 for all t > N(e). 

Then 


1 r T 1 r r N ^ rT 

T) J(t)dt-a \m~ a \ d t + l N(e) \m-a\dt] 


<2 M^ + i. 


Hence, for all T ^ T^e) = 4Af[A^(e)/e], we have 


(t)J> 


dt 


< e. 


8.4. Use the relations (8.23) and (8.24). 
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8.5. Follow the line of argument employed in proving (8.42) and 
(8.43). 

8*6* Note that 

y 

since Qf*Qf = E„f , while Qf*Q^ = 0 for p # y due to the fact 
that the initial domain R+* of is orthogonal to the final domain 

Rjv> of Q+ ] . A similar argument leads to the conclusion that 


q(b)* S ’qM = 


8*7* Prove that 


(/\g) = S </\ u *g> dt 


is a bounded bilinear form, and apply the result of Exercise 2.5 in 
Chapter III. 
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square integrable, 101 
total variation of, 467 
Functional, see also Vector spaces 
bounded,182 
continuous, 182 
linear, 182 
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G 

Gelfand triples, see Hilbert spaces, 
rigged 

Green’s function, 501 
advanced,503 
free advanced, 524 
free retarded, 524 
full, 502, 528 
series expansion of, 529 
partial wave, 525 
retarded, 503 

symmetry properties of, 531 
Green’s operator, 501 
Ground state, 168 


H 

Hack’s theorem, 610 
Hahn’s theorem, 237 
Hamiltonian, 286 
of arrangement channel, 599 
eigenfunction expansion for, 500 
internal free, 442 
time-dependent, 292, 372 
Harmonic oscillator, 55 
Heisenberg’s equation, 3,294 
Heisenberg’s picture, 293 
Heisenberg’s uncertainty principle, 8 
Hellinger and Toeplitz theorem, 195 
Hilbert spaces, 30, 32,33,41,109,220 
equipped, 498 
extended, 186 

Hilbert tensor products of, 144,145 
isometric transformation of, 212 
rigged, 186 


I 

Idempotent, 200 
Identification operator, 613 
incident flux, 426 
Indicial equation, 165 
Inertial frame, 4 
Inner product, 18 
Integral 
Bochner, 480 
Lebesgue, 90, 114 


Riemann-Stieltjes, 463 
improper, 464 
cross-iterated, 464 
spectral, 483 
time-ordered, 372 
Interacting state, 420 
Interaction picture, 298 
Intertwining property, see Wave operators 
Isometric operators and transformations, 
212 

Isomorphism of Euclidean spaces, 23 


K 

Kato’s theorem, 367 
Kernel 

Hilbert-Schmidt, 526 
L 2 , see Kernel Hilbert-Schmidt 
trace class, 526 
Kummer’s series, 165 


L 

Laboratory frame of reference, 119 
Laguerre functions, 170 
Laguerre polynomials, 170 
generating function of, 171 
Lebesgue bounded (dominated) convergence 
theorem, 287 

Lebesgue intergral, see Integral 
Lebesgue monotone convergence theorem, 
92 

Legendre functions, associated, 152 
Legendre polynomials, associated, 167 
Limit 

strong, see Operators, sequences 
uniform, see Operators, sequences 
weak, see Operators, sequences 
Limit point, 537, see also Accumulation 
point 

Weyl’s criterion for, 537 
Linear manifold, see Vector spaces 
Linear operator, see Operators 
Linear space, see Vector spaces 
Linear subspace, see Vector subspace 
Liouville equation, 397 
Liouville space, 405 
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Lippmann- Schwinger equations 
for eigenfunctions, 503,618 
in Hilbert space, 470 
perturbation solution of, 456 
solution-type, 472 
for Fob, 511 
two- Hilbert space ,615 


M 

Mapping 

Borel measurable, 115 
isometric, 27 
isomorphism, 175 
Matrix mechanics, 297 
Mean value theorem, 490 
Measurable space, 67 
simple, 81 
Measure, 58,67 
complex, 232 
continuous, 240 
extension of, 70 
finite, 67,79,232 
inner, 79 

Lebesgue, 67,78,79,114 
Lebesgue-Stieltjes, 79 
lower variation of, 239 
outer, 71 

positive operator-valued, 406 
probability, 68 
signed, 236,237 
spectral, 231 
upper variation, 239 
Measure spaces, 76 
Cartesian product of, 77 
Measurement 
determinative, 6,262,424 
Bom’s correspondence rule for, 262 
preparatory, 6,264,266 
theory of, 4,262, 264 
Metric, 25 
Metric space, 25 
complete, 26 
completion, 27 
densely embedded, 27 
Miller wave operators, 421,603 
Momentum representation, 354 
Monotone class of sets, 63 
Monotonic sequence of operators, 

242 


N 

Neighborhood, 27 
Norm, 20 

completeness in, 30 
convergence in, 30 
Hilbert-Schmidt, 384 
Rollnik, 545 
trace norm, 386 


O 

Observables, 2 
compatible, 260,262 
complete set of, 316,498 
expectation value, 277 
fundamental, 270 
Operators 
adjoint, 188 

antilinear, 173, see also Transformations, 
antilinear 

asymptotically compensating, 515 
bound of, 178 
bounded,242 
from above, 535 
from below, 535 
relative to, 360 
canonical form, 379 
canonically conjugate, 329 
closed, 191 
compact, 375 
complete set of, 315 
completely continuous, 375 
core, 366 

density, see Statistical operator 
essentially self-adjoint, 358 
extension, 186 
finite-rank, 377 
function of, 272 
graph of, 191 
Hermitian, 127 
Hilbert-Schmidt, 383 
hypermaximal, see Self-adjoint operators 
linear, 173, 186, 192, see also 
Transformations, linear 
closure of, 355 

matrix representation of, 175,176 
monotone sequences of, 242 
positive definite, 381 
projection, see Projector 
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relative bound, 361 
restriction, 186 

self-adjoint, see Self-adjoint operators 
sequences 

strong limit of, 230, 231 
uniform limit of, 230 
weak limit of, 230, 231 
symmetric, 192 
Optical theorem, 513 
generalized, 513 
Orthogonal system, 21 
Orthonormal basis, 36 


P 

Parseval’s relation, 38 
Partially isometric operators, 440 
Pauli’s exclusion principle, 168, 307 
Phase shifts, 497, 589, 593 
Phase space, 2 
Physical equivalence, 293 
Physical interpretation, see Measurement, 
theory of 

Planck constant, 46 
Plane waves 
advanced distorted, 495 
distorted, 494 
free, 491 

incoming distorted, 495 
outgoing distorted, 495 
retarded distorted, see Plane waves, 
outgoing distorted 
Potential 
central, 149 
long-range, 514 
spherically symmetric, 149 
Pre-Hilbert space, 18 

Principle of indistinguishability of identical 
particles, 307 
Probability, 46 

amptitude, see Transition, amplitude 
current, 410,411 
frequency interpretation, 47 
Projector, 197 


Q 

Quantum number 
azimuthal, 168 


magnetic, 168 
principal, 168 
Quotient space, 138 


R 

R operator, 511 

Radon- Nykodim theoreom, 269 
Rellich’s theorem, 366 
Resolution of identity, 235 
Resolvent equations 
first, 478 
second, 478 

Resolvent of operators, 475 
Resolvent set, 475 
Riemann-Lebesgue lemma, 216 
Riemann-Stieltjes sums, 464 
Riesz- Fischer theorem, 105 
Riesz theorem, 184 
Ring, Boolean, 66 
Rodrigues’ formula, 152 
Rollnik class, 545 
Rollnik norm, 545 


S 

S matrix, 512, see also Scattering, matrix 
partial-wave, 497 
S operator, 423,438 
for multichannel scattering, 607, 613 
perturbation expansion of, 450 
unitarity of, 442 

Scalar product, see Inner product 
Scattering 

amplitude, 494, 553, 556 
elastic, 597 
experiments, 416,425 
inelastic, 598 
matrix, 512 

operator, see S operator 
state, 421 

superoperator, 487 
theory 

stationary, 458 
three-body, 622 
time dependent, 438 
time independent, see Scattering, 
theory, stationary 
Schmidt procedure, see Vectors 



684 


Index 


Schroedinger equation, 3,46,48,122,332 
Schroedinger operator, 355 
essential self-adjointness of, 358, 368 
form, 122 
n-body, 536 

Schroedinger picture, 291 
Schwarz - Cauchy inequality, 19, 20 
Self-adjoint operators, 192 
commuting, 258,259,261 
spectral measure of, 250 
Sequence 
Cauchy,26 

contracting, see Sequence, monotonically 
decreasing 

convergence in mean, 105 
expanding, see Sequence, monotonically 
increasing 

fundamental, see also Sequence, Cauchy 
in mean, 104 

monotonically decreasing, 63 
monotonically increasing, 63,209 
of operators, see Operators, sequences 
Sesquilinear forms, see Bilinear form 
Set function, 58 
continuous, 68 
measurable, 80 
monotone, 79 
subtractive, 79 
Sets 

additive class of, see Ring, Boolean 
Borel measurable, 80 
closed, 27 

Lebesgue measurable, 80 
measurable, 65,71,74 
monotone class of, 63 
partially ordered, 203 
Shift operator, 213 

Spectral decomposition of operators, 242, 
252 

Spectral family, see Spectral function 
Spectral function, 235 
Spectral integrals, 483 
Spectral measure, 231 
Spectral representation space of operators, 
315 

Spectrum of operators 
absolutely continuous, 438 
continuous, 253,475 
discrete, 537 
essential, 537 
point, 49,227,253,475 


residual, 475 
simple, 315,447 
singularly continuous, 552 
Spherical wave, outgoing free, 496 
Spin 

measurement of, 9 
Stem- Gerlach experiment, 9 
State vectors, 301 
States, 2 

Statistical operator, 269 

Stem- Gerlach experiment, see Spin 

Stochastic phase space, 410 

Stochastic value, 410 

Stone’s theorem, 335 

Subspaces 

orthogonal complements of, 199 
orthogonal sums of, 199,200 
Superoperator, 398 
Super selection rules, 301, 302 


T 

T matrix, 508, 620, see also Transition, 
matrix 

operators, 511 
Tensor product, 303 
algebraic, 141 
antisymmetric, 306 
Hilbert, 144 
symmetric, 306 
Threshold, 598 
Time evolution, 287, 292 
Tonelli’s theorem, 98 
Trace of an operator, 380 
class, 380 
Transform 
Cayley, 219,223 
Fourier, 218 

Fourier-Plancherel, 219, 224 
Transformations 
antilinear, 172 
bounded,178 
continuous, 178 
isometric, 212 
linear, 172 

semilinear, see Transformations, 
antilinear 
unitary, 212 

of vector spaces, 172, see also Operators 
Transformer, 398 
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Transition 
amplitude, 472 
matrix, 506, 510 
operator, 423,474, 510 
probability, 268,472 
superoperator, 488 
Translation, 576 
Trotter’s product formula, 585 


U 

Unitary equivalence, 332, see also 
Isomorphism of Euclidean spaces 
Unitary operator, 212 
Unitary space, see Euclidean spaces 

V 

Vector basis, 15 
Vector spaces, 11, 36 
algebraic tensor products of, 141 
conjugate, see Vector spaces, dual 
dimension of, 14 
dual, 183 

functional on, 182 

inner product, see Euclidean spaces 
isomorphism of, 16 
Vector subspace, 16 
Vectors 
bra, 186,190 
ket, 186,190 
linearly independent, 14 
normalization of, 21 
orthogonal, 21 
orthogonal system of, 21 


orthonormalization procedure of Schmidt 
(or Gram-Schmidt), 22 
span of, 15, 36 

von Neumann’s equation, 396 
von Neumann’s theorem, 342 


W 

Wave 

advanced distorted, see Wave, incoming 
distorted 
free, 491,496 

incoming distorted, 495,496 
outgoing distorted, 495, 496 
retarded distorted, see Wave, outgoing 
distorted 

Wave function, 45,120 
Wave operators 
channel, 603 
complete, 552 
existence of, 438 
incoming, 423 

intertwining properties of, 439, 603 
modified, 514 
outgoing, 423 
renormalized, 514 
Wave packet, evanescence of, 596 
Wave superoperators, 487 
Weierstrass approximation theorem, 

157 

Weyl relations, 333 


Y 

Yukawa interaction, 365 
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